(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

Internationa] Bureau 




(43) International Publication Date (10) International Publication Number 

14 February 2002 (14.02.2002) PCT WO 02/12340 A2 



(51) International Patent Classification 7 : C07K 14/705 

(21) International Application Number: PCT/US01/24217 

(22) International Filing Date: 1 August 2001 (01 .08.2001) 



(25) Filing Language: 



(26) Publication Language: 



English 



English 



(30) Priority Data: 

60/223,269 
60/224,456 
60/226,410 
60/228,140 
60/230,067 
60/231,434 



3 August 2000 (03.08.2000) US 

10 August 2000 (10.08.2000) US 

18 August 2000 (18.08.2000) US 

25 August 2000 (25.08.2000) US 

3 1 August 2000 (3 1 .08.2000) US 

8 September 2000 (08.09.2000) US 



as (71) Applicant (for all designated States except US): INCYTE 
= GENOMICS, INC. [US/US]; 3160 Porter Drive, Palo 
= Alto, CA 94304 (US). 

~£ (72) Inventors; and 

= (75) Inventors/Applicants (for US only): YUE, Henry 
= [US/US]; 826 Lois Avenue, Sunnyvale, CA 94087 (US). 

— THORNTON, Michael [US/US]; 9 Mcdway Road, 
= Woodside, CA 94062 (US). RAMKUMAR, Java lax mi 
H [IN/US]; 34359 Maybird Circle, Fremont, CA 94555 
= (US). TANG, Y n Tom [US/US]; 4230 Ran wick Court, 
H San Jose, CA 95118 (US). AZIMZAI, Yalda [US/US]; 
== 5518 Boulder Canyon Drive, Castro Valley, CA 94522 
= (US). BAUGHN, Marian, IL [US/US]; 14244 Santiago 
H| Road, San Leandro, CA 94577 (US). YANG, Junming 

— [CN/US]; 7125 Bank l^ane, San Jose, CA 95129 (US). 
= YAO, Monique, G. [US/US]; 1 1 1 Frederick Court, Moun- 
= tain View, CA 94043 (US). LAL, Preeti [IN/US]; P.O. Box 
= 5142, Santa Clara, CA 95056 (US). WALLA, Narinder, 1C 
= [US/USJ; 890 Roble Avenue #1, San Leandro, CA 94577 
§= (US). GANDHI, Ameena, IL [US/US1; 837 Roble Av- 

— cnue, #1, Mcnlo Park, CA 94025 (US). HAFALIA, April, 
J„ A. [US/US]; 2227 Calle de Primavera, Santa Clara, 
CA 95054 (US). NGUYEN, Danniel, B. [US/US]; 1403 
Ridgewood Drive, San Jose, CA 951 18 (US). PATTER- 
SON, Chandrea [US/US]; 490 Sherwood Way #1, Menlo 
Park, CA 94025 (US). ELLIOTT, Vicki, S. [US/US]; 
3770 Polten Place Way, San Jose, CA 95121 (US). TRI- 
BOULEY, Catherine, M. [FR/USj; 1121 Tennessee, 



< 



#5, San Francisco, CA 94107 (US). LU, DYUNG, Aina, 
ML [US/US]; 233 Coy Drive, San Jose, CA 95123 (US). 
XU, Yuming [US/US]; 1739 Walnut Drive, Mountain 
View, CA 94040 (US). REDDY, Roopa [IN/US]; 1233 W. 
McKinely Avenue #3, Sunnyvale, CA 94086 (US). HER- 
NANDEZ, Roberto [GB/GB]; 160 Pine Tree Avenue, 
Canterbury Kent, CT2 7SZ (GB). BOROWSKY, Mark, 
L. [US/USJ; 122 Orchard Avenue, Redwood City, CA 
94061 (US). LO, Terence, P. [CA/US]; 1451 Beach Park 
Boulevard, Apt 115, Foster City, CA 94404 (US). LU, 
Yan [CN/US]; 3885 Corrina Way, Palo Alto, CA 94303 
(US). POLICKY, Jennifer, L. [US/US]; 1511 Jarvis 
Court, San Jose, CA 95118 (US). GREENE, Barrie, 
D. [US/US]; 1332 10th Avenue #104, San Franciso, CA 
94122 (US). SANJANWALA, Madhu, S. [US/US]; 210 
Sylvia Court, Los Altos, CA 94024 (US). RAUMANN, 
Brigitte, E. [US/US]; 5801 South Dorchester Avenue #3B, 
Chicago, IL 60637 (US). BURFORD, Neil [GB/US]; 105 
Wildwood Circle, Durham, CT 06422 (US). ISON, Craig, 
H. [US/US]; 1242 Weathersfledl Way, San Jose, CA 95 1 18 
(US). LEE, Ernestine, A. [US/US]; 624 Kains Street, 
Albany, CA 94706 (US). DING, Li [CN/USJ; 3353 Alma 
Street #146, Palo Alto, CA 94306 (US). DAS, Debopriya 
[IN/US]; 1179 Bonita Avenue Apt. 3, Mountain View, 
CA 94040 (US). KALLICK, Deborah, A. [US/US]; 
900 Olive Street, Menlo Park, CA 94025 (US). KHAN, 
Farrah, A. [IN/US]; 3617 Central Road #102, Glenview, 
IL 60025 (US). SEILHAMER, Jeffrey, J. [US/US]; 
12555 La Cresta, Los Altos, CA 94022 (US). 

(74) Agents: HAMLET-COX, Diana et al.; Incyte Pharma- 
ceuticals, Inc., 3160 Porter Drive, Palo Alto, CA 94304 
(US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EE, ES, FI, GB, GD, GE, GH, GM, 
HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, 
LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, 
MZ, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, 
TJ, TM, TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE, TR), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, GQ, GW, ML, MR, NE, SN, TD, 
TG). 

[Continued on next page] 



<S (54) Title: TRANSPORTERS AND ION CHANNELS 

O ( 57 ) Abstract: The invention provides human transporters and ion channels (TRICH) and polynucleotides which identify and en- 
code TRICH. The invention also provides expression vectors, host cells, antibodies, agonists, and antagonists. The invention also 
^ provides methods for diagnosing, treating, or preventing disorders associated with aberrant expression of TRICH. 



WO 02/12340 A2 IlimililllinillllilMIIlllllBllMI 



Published: For two-letter codes and other abbreviations, refer to the "Guid- 

— without international search report and to be republished ance Notes on Codes and Abbreviations " appearing at the begin- 
upon receipt of that report ning of each regular issue of the PCT Gazette. 



WO 02/12340 



PCT/US01/24217 



TRANSPORTERS AND ION CHANNELS 

TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of transporters and ion 
5 channels and to the use of these sequences in the diagnosis, treatment, and prevention of transport, 
neurological, muscle, immunological, and cell proliferative disorders, and in the assessment of the 
effects of exogenous compounds on the expression of nucleic acid and amino acid sequences of 
transporters and ion channels. 

10 BACKGROUND OF THE INVENTION 

Eukaryotic cells are surrounded and subdivided into functionally distinct organelles by 
hydrophobic lipid bilayer membranes which are highly impermeable to most polar molecules. Cells 
and organelles require transport proteins to import and export essential nutrients and metal ions 
including K + , NH**, Pj, SO4 2 ", sugars, and vitamins, as well as various metabolic waste products. 

15 Transport proteins also play roles in antibiotic resistance, toxin secretion, ion balance, synaptic 
neurotransmission, kidney function, intestinal absorption, tumor growth, and other diverse cell 
functions (Griffith, J. and C. Sansom (1998) The Transporter Facts Book . Academic Press, San Diego 
CA, pp. 3-29). Transport can occur by a passive concentration-dependent mechanism, or can be 
linked to an energy source such as ATP hydrolysis or an ion gradient. Proteins that function in 

20 transport include carrier proteins, which bind to a specific solute and undergo a conformational 
change that translocates the bound solute across the membrane, and channel proteins, which form 
hydrophilic pores that allow specific solutes to diffuse through the membrane down an 
electrochemical solute gradient. 

Carrier proteins which transport a single solute from one side of the membrane to the other 

25 are called uniporters: In contrast, coupled transporters link the transfer of one solute with 

simultaneous or sequential transfer of a second solute, either in the same direction (symport) or in the 
opposite direction (antipoxt). For example, intestinal and kidney epithelium contains a variety of 
symporter systems driven by the sodium gradient that exists across the plasma membrane. Sodium 
moves into the cell down its electrochemical gradient and brings the solute into the cell with it. The 

30 sodium gradient that provides the driving force for solute uptake is maintained by the ubiquitous 
Na + /K + ATPase system. Sodium-coupled transporters include the mammalian glucose transporter 
(SGLT1), iodide transporter (NIS), and multivitamin transporter (SMVT). All three transporters have 
twelve putative transmembrane segments, extracellular glycosylation sites, and cytoplasmically- 
oriented N- and C-termini. NIS plays a crucial role in the evaluation, diagnosis, and treatment of 

35 various thyroid pathologies because it is the molecular basis for radioiodide thyroid-imaging 
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techniques and for specific targeting of radioisotopes to the thyroid gland (Levy, O. et al. (1997) 
Proc. Natl. Acad. Sci. USA 94:5568-5573). SMVT is expressed in the intestinal mucosa, kidney, and 
placenta, and is implicated in the transport of the water-soluble vitamins, e.g., biotin and pantothenate 
(Prasad, P.D. et al. (1998) J. Biol. Chem. 273:7501-7506). 
5 One of the largest families of transporters is the major facilitator superfamily (MPS), also 

called the uniporter-symporter-antiporter family. MFS transporters are single polypeptide carriers 
that transport small solutes in response to ion gradients. Members of the MFS are found in all classes 
of living organisms, and include transporters for sugars, oligosaccharides, phosphates, nitrates, 
nucleosides, monocarboxylates, and drugs. MFS transporters found in eukaryotes all have a structure 

10 comprising 12 transmembrane segments (Pao, S.S. et al. (1998) Microbiol. Molec. Biol. Rev. 62:1- 
34). The largest family of MFS transporters is the sugar transporter family, which includes the seven 
glucose transporters (GLUT1-GLUT7) found in humans that are required for the transport of glucose 
and other hexose sugars. These glucose transport proteins have unique tissue distributions and 
physiological functions. GLUT1 provides many cell types with their basal glucose requirements and 

15 transports glucose across epithelial and endothelial barrier tissues; GLUT2 facilitates glucose uptake 
or efflux from the liver, GLUT3 regulates glucose supply to neurons; GLUT4 is responsible for 
insulin-regulated glucose disposal; and GLUT5 regulates fructose uptake into skeletal muscle. 
Defects in glucose transporters are involved in a recently identified neurological syndrome causing 
infantile seizures and developmental delay, as well as glycogen storage disease, Fanconi-Bickel 

20 syndrome, and non-insulin-dependent diabetes mellitus (MuecWer, M. (1994) Eur. J. Biochem. 
219:713-725; Longo, N. andLJ. Elsas (1998) Adv. Pediatr. 45:293-313). 

Monocarboxylate anion transporters are proton-coupled symporters with a broad substrate 
specificity that includes L-lactate, pyruvate, and the ketone bodies acetate, acetoacetate, and 
beta-hydroxybutyrate. At least seven isoforms have been identified to date. The isoforms are 

25 predicted to have twelve transmembrane (TM) helical domains with a large intracellular loop between 
TM6 and TM7, and play a critical role in maintaining intracellular pH by removing the protons that 
are produced stoichiometrically with lactate during glycolysis. The best characterized 
H + -monocarboxyIate transporter is that of the erythrocyte membrane, which transports I^lactate and a 
wide range of other aliphatic monocarboxylates. Other cells possess HMinked monocarboxylate 

30 transporters with differing substrate and inhibitor selectivities. In particular, cardiac muscle and 
tumor cells have transporters that differ in their values for certain substrates, including 
stereoselectivity for 1^ over D-lactate, and in their sensitivity to inhibitors. There are 
Na + -monocarboxylate cotransporters on the luminal surface of intestinal and kidney epithelia, which 
allow the uptake of lactate, pyruvate, and ketone bodies in these tissues. In addition, there are 

35 specific and selective transporters for organic cations and organic anions in organs including the 
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kidney, intestine and liver. Organic anion transporters are selective for hydrophobic, charged 
molecules with electron-attracting side groups. Organic cation transporters, such as the ammonium 
transporter, mediate the secretion of a variety of drugs and endogenous metabolites, and contribute to 
the maintenance of intercellular pH (Poole, R.C. and A.P. Halestrap (1993) Am. J. Physiol. 

5 264:C761-C782; Price, NT. et al. (1998) Biochem. J. 329:321-328; and Martinelle, K. and L 
Haggstrom (1993) J. Biotechnol. 30:339-350). 

ATP-binding cassette (ABC) transporters are members of a superfamily of membrane 
proteins that transport substances ranging from small molecules such as ions, sugars, amino acids, 
peptides, and phospholipids, to lipopeptides, large proteins, and complex hydrophobic drugs. ABC 

10 transporters consist of four modules: two nucleotide-binding domains (NBD), which hydrolyze ATP 
to supply the energy required for transport, and two membrane-spanning domains (MSD), each 
containing six putative transmembrane segments. These four modules may be encoded by a single 
gene, as is the case for the cystic fibrosis transmembrane regulator (CFTR), or by separate genes. 
When encoded by separate genes, each gene product contains a single NBD and MSD. These 4< half- 

15 molecules" form homo- and heterodimers, such as Tapl and Tap2, the endoplasmic reticulum-based 
major histocompatibility (MHQ peptide transport system Several genetic diseases are attributed to 
defects in ABC transporters, such as the following diseases and their corresponding proteins: cystic 
fibrosis (CFTR, an ion channel), adrenoleukodystrophy (adrenoleukodystrophy protein, ALDP), 
Zellweger syndrome (peroxisomal membrane protein-70, PMP70), and hyperinsulinemic 

20 hypoglycemia (sulfonylurea receptor, SUR). Overexpression of the multidrug resistance (MDR) 
protein, another ABC transporter, in human cancer cells makes the cells resistant to a variety of 
cytotoxic drugs used in chemotherapy (Taglicht, D. and S. Michaelis (1998) Meth. Enzymol. 
292:130-162). 

A number of metal ions such as iron, zinc, copper, cobalt, manganese, molybdenum, 
25 selenium, nickel, and chromium are important as cofactors for a number of enzymes. For example, 
copper is involved in hemoglobin synthesis, connective tissue metabolism, and bone development, by 
acting as a cofactor in oxidoreductases such as superoxide dismutase, ferroxidase (ceruloplasmin), 
and lysyl oxidase. Copper and other metal ions must be provided in the diet, and are absorbed by 
transporters in the gastrointestinal tract Plasma proteins transport the metal ions to the liver and 
30 other target organs, where specific transporters move the ions into cells and cellular organelles as 
needed. Imbalances in metal ion metabolism have been associated with a number of disease states 
(Danks, D.M. (1986) J. Med. Genet. 23:99-106). 

Transport of fatty acids across the plasma membrane can occur by diffusion, a high capacity, 
low affinity process. However, under normal physiological conditions a significant fraction of fatty 
35 acid transport appears to occur via a high affinity, low capacity protein-mediated transport process. 
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Fatty acid transport protein (FATP), an integral membrane protein with four transmembrane 
segments, is expressed in tissues exhibiting high levels of plasma membrane fatty acid flux, such as 
muscle, heart, and adipose. Expression of FATP is upregulated in 3T3-L1 cells during adipose 
conversion, and expression in COS7 fibroblasts elevates uptake of long-chain fatty acids (Hui, T.Y. et 

5 al. (1998) J. Biol. Chem. 273:27420-27429). 

Mitochondrial carrier proteins are lxansmembrane-spanning proteins which transport ions and 
charged metabolites between the cytosol and the mitochondrial matrix. Examples include the ADP, 
ATP carrier protein; the 2-oxoglutarate/malate carrier, the phosphate carrier protein; the pyruvate 
carrier, the dicarboxylate carrier which transports malate, succinate, fumarate, and phosphate; the 

10 tricarboxylate carrier which transports citrate and malate; and the Grave's disease carrier protein, a 
protein recognized by IgG in patients with active Grave's disease, an autoimmune disorder resulting 
in hyperthyroidism. Proteins in this family consist of three tandem repeats of an approximately 100 
amino acid domain, each of which contains two transmembrane regions (Stryer, L. (1995) 
Biochemistry, W.H. Freeman and Company, New York NY, p. 551; PROS1TE PDOC00189 

15 Mitochondrial energy transfer proteins signature; Online Mendelian Inheritance in Man (OMM) 
♦275000 Graves Disease). 

This class of transporters also includes the mitochondrial uncoupling proteins, which create 
proton leaks across the inner mitochondrial membrane, thus uncoupling oxidative phosphorylation 
from ATP synthesis. The result is energy dissipation in the form of heat. Mitochondrial uncoupling 

20 proteins have been implicated as modulators of thermoregulation and metabolic rate, and have been 
proposed as potential targets for drugs against metabolic diseases such as obesity (Ricquier, D. et al. 
(1999) J. Int Med. 245:637-642). 
Ion Channels 

The electrical potential of a cell is generated and maintained by controlling the movement of 
25 ions across the plasma membrane. The movement of ions requires ion channels, which form ion- 
selective pores within the membrane. There are two basic types of ion channels, ion transporters and 
gated ion channels. Ion transporters utilize the energy obtained from ATP hydrolysis to actively 
transport an ion against the ion's concentration gradient Gated ion channels allow passive flow of an 
ion down the ion's electrochemical gradient under restricted conditions. Together, these types of ion 
30 channels generate, maintain, and utilize an electrochemical gradient that is used in 1) electrical 
impulse conduction down the axon of a nerve cell, 2) transport of molecules into cells against 
concentration gradients, 3) initiation of muscle contraction, and 4) endocrine cell secretion. 
Ion Transporters 

Ion transporters generate and maintain the resting electrical potential of a cell. Utilizing the 
35 energy derived from ATP hydrolysis, they transport ions against the ion's concentration gradient. 
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These transmembrane ATPases are divided into three families. The phosphorylated (P) class ion 
transporters, including Na + -K + ATPase, Ca 2+ -ATPase, and H + -ATPase, are activated by a 
phosphorylation event P-class ion transporters are responsible for maintaining resting potential 
distributions such that cytosolic concentrations of Na + and Ca 2+ are low and cytosolic concentration 

5 of K + is high. The vacuolar (V) class of ion transporters includes H + pumps on intracellular 

organelles, such as lysosomes and Golgi. V-class ion transporters are responsible for generating the 
low pH within the lumen of these organelles that is required for function. The coupling factor (F) 
class consists of H + pumps in the mitochondria. F-class ion transporters utilize a proton gradient to 
generate ATP from ADP and inorganic phosphate (Pj). 

10 The P-ATPases are hexamers of a 100 kD subunit with ten transmembrane domains and 

several large cytoplasmic regions that may play a role in ion binding (Scarborough, G.A. (1999) Curr. 
Opin. Cell Biol. 11:517-522). The V-ATPases are composed of two functional domains: the V, 
domain, a peripheral complex responsible for ATP hydrolysis; and the V 0 domain, an integral 
complex responsible for proton translocation across the membrane. The F-ATPases are structurally 

15 and evolutionarily related to the V-ATPases. The F-ATPase F 0 domain contains 12 copies of the c 
subunit, a highly hydrophobic protein composed of two transmembrane domains and containing a 
single buried carboxyl group in TM2 that is essential for proton transport. The V-ATPase V 0 domain 
contains three types of homologous c subunits with four or five transmembrane domains and the 
essential carboxyl group in TM4 or TM3. Both types of complex also contain a single a subunit that 

20 may be involved in regulating the pH dependence of activity (Forgac, M. (1999) J. Biol. Chem. 
274:12951-12954). 

The resting potential of the cell is utilized in many processes involving carrier proteins and 
gated ion channels. Carrier proteins utilize the resting potential to transport molecules into and out of 
the cell. Amino acid and glucose transport into many cells is linked to sodium ion co-transport 
25 (symport) so that the movement of Na + down an electrochemical gradient drives transport of the other 
molecule up a concentration gradient Similarly, cardiac muscle links transfer of Ca 2+ out of the cell 
with transport of Na + into the cell (antiport). 
Grated Ion Channels 

Gated ion channels control ion flow by regulating the opening and closing of pores. The 
30 ability to control ion flux through various gating mechanisms allows ion channels to mediate such 
diverse signaling and homeostatic functions as neuronal and endocrine signaling, muscle contraction, 
fertilization, and regulation of ion and pH balance. Gated ion channels are categorized according to 
the manner of regulating the gating function. Mechanically-gated channels open their pores in 
response to mechanical stress; voltage-gated channels (e.g., Na + , K + , Ca 2+ , and CI" channels) open 
35 their pores in response to changes in membrane potential; and ligand-gated channels (e.g., 
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acetylcholine-, serotonin-, and glutamate-gated cation channels, and GABA- and glycine-gated 
chloride channels) open their pores in the presence of a specific ion, nucleotide, or neurotransmitter. 
The gating properties of a particular ion channel (i.e., its threshold for and duration of opening and 
closing) are sometimes modulated by association with auxiliary channel proteins and/or post 
5 translational modifications, such as phosphorylation. 

Mechanically-gated or mechanosensitive ion channels act as transducers for the senses of 
touch, hearing, and balance, and also play important roles in cell volume regulation, smooth muscle 
contraction, and cardiac rhythm generation. A stretch-inactivated channel (SIC) was recently cloned 
from rat kidney. The SIC channel belongs to a group of channels which are activated by pressure or 
10 stress on the cell membrane and conduct both Ca 2 * and Na + (Suzuki, M. et al. (1999) J. Biol. Chem. 
274:6330-6335). 

The pore-forming subunits of the voltage-gated cation channels form a superfamily of ion 
channel proteins. The characteristic domain of these channel proteins comprises six transmembrane 
domains (S1-S6), a pore-forming region (P) located between S5 and S6, and intracellular amino and 

15 carboxy termini. In the Na* and Ca 2 * subfamilies, this domain is repeated four times, while in the K + 
channel subfamily, each channel is formed from a tetramer of either identical or dissimilar subunits. 
The P region contains information specifying the ion selectivity for the channel. In the case of K + 
channels, a GYG tripeptide is involved in this selectivity (Ishii, T.M. et al. (1997) Proc. Natl. Acad. 
Sci. USA 94:11651-11656). 

20 Voltage-gated Na + and K + channels are necessary for the function of electrically excitable 

cells, such as nerve and muscle cells. Action potentials, which lead to neurotransmitter release and 
muscle contraction, arise from large, transient changes in the permeability of the membrane to Na + 
and K + ions. Depolarization of the membrane beyond the threshold level opens voltage-gated Na + 
channels. Sodium ions flow into the cell, further depolarizing the membrane and opening more 

25 voltage-gated Na + channels, which propagates the depolarization down the length of the cell. 
Depolarization also opens voltage-gated potassium channels. Consequently, potassium ions flow 
outward, which leads to repolarization of the membrane. Voltage-gated channels utilize charged 
residues in the fourth transmembrane segment (S4) to sense voltage change. The open state lasts only 
about 1 millisecond, at which time the channel spontaneously converts into an inactive state that 

30 cannot be opened irrespective of the membrane potential. Inactivation is mediated by the channel's 
N-terminus, which acts as a plug that closes the pore. The transition from an inactive to a closed state 
requires a return to resting potential. 

Voltage-gated Na + channels are heterotrimeric complexes composed of a 260 kDa pore- 
forming a subunit that associates with two smaller auxiliary subunits, pi and £2. The 02 subunit is a 

35 integral membrane glycoprotein that contains an extracellular Ig domain, and its association with a 
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and pi subunits correlates with increased functional expression of the channel, a change in its gating 
properties, as well as an increase in whole cell capacitance due to an increase in membrane surface 
area (Isom, L.L. et al. (1995) Cell 83:433-442). 

Non voltage-gated Na + channels include the members of the amiloride-sensitive Na* 

5 channel/degenerin (NaC/DEG) family. Channel subunits of this family are thought to consist of two 
transmembrane domains flanking a long extracellular loop, with the amino and carboxyl termini 
located within the cell. The NaCZDEG family includes the epithelial Na + channel (ENaC) involved in 
Na + reabsorption in epithelia including the airway, distal colon, cortical collecting duct of the kidney, 
and exocrine duct glands. Mutations in ENaC result in pseudohypoaldosteronism type 1 and Liddle's 

10 syndrome (pseudohyperaldosteronism). The NaC/DEG family also includes the recently 

characterized EP-gated cation channels or acid-sensing ion channels (ASIC). ASIC subunits are 
expressed in the brain and form heteromultimeric Na + -permeable channels. These channels require 
acid pH fluctuations for activation. ASIC subunits show homology to the degenerins, a family of 
mechanically-gated channels originally isolated from C elegans. Mutations in the degenerins cause 

15 neurodegeneration. ASIC subunits may also have a role in neuronal function, or in pain perception, 
since tissue acidosis causes pain (Waldmann, R. and M. Lazdunski (1998) Curr. Opin. Neurobiol. 
8:418^24; Eglen, R.M. et al. (1999) Trends Pharmacol. Sci. 20:337-342). 

K + channels are located in all cell types, and may be regulated by voltage, ATP 
concentration, or second messengers such as Ca 2+ and cAMP. In non-excitable tissue, K + channels 

20 are involved in protein synthesis, control of endocrine secretions, and the maintenance of osmotic 
equilibrium across membranes. In neurons and other excitable cells, in addition to regulating action 
potentials and repolarizing membranes, K + channels are responsible for setting resting membrane 
potential. The cytosol contains non-diffusible anions and, to balance this net negative charge, the cell 
contains a Na + -K + pump and ion channels that provide the redistribution of Na + , K + , and CT. The 

25 pump actively transports Na + out of the cell and K + into the cell in a 3:2 ratio. Ion channels in the 
plasma membrane allow K + and CT to flow by passive diffusion. Because of the high negative charge 
within the cytosol, CI " flows out of the cell. The flow of K + is balanced by an electromotive force 
pulling K + into the cell, and a K + concentration gradient pushing K + out of the cell. Thus, the resting 
membrane potential is primarily regulated by K + flow (Salkoff, L. and T. Jegla (1995) Neuron 15:489- 

30 492). 

Potassium channel subunits of the Shaker -like superfamily all have the characteristic six 
transmembrane/1 pore domain structure. Four subunits combine as homo- or heterotetramers to form 
functional K channels. These pore-forming subunits also associate with various cytoplasmic P 
subunits that alter channel inactivation kinetics. The Shaker- like channel family includes the voltage- 
35 gated K + channels as well as the delayed rectifier type channels such as the human ether-a-go-go 
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related gene (HERG) associated with long QT, a cardiac dysrythrnia syndrome (Girran, M.E. (1998) 
Curr. Opin. Biotechnol. 9:565-572; Kaczorowski, GJ. and M.L. Garcia (1999) Curr. Opin. Chem. 
Biol. 3:448-458). 

A second superfarnily of K + channels is composed of the inward rectifying channels (Kir). 

5 Kir channels have the property of preferentially conducting K + currents in the inward direction. 

These proteins consist of a single potassium selective pore domain and two transmembrane domains , 
which correspond to the fifth and sixth transmembrane domains of voltage-gated K + channels. Kir 
subunits also associate as tetramers. The Kir family includes ROMK1, mutations in which lead to 
Bartter syndrome, a renal tubular disorder. Kir channels are also involved in regulation of cardiac 

10 pacemaker activity, seizures and epilepsy, and insulin regulation (Doupnik, C.A. et al. (1995) Curr. 
Opin. Neurobiol. 5:268-277; Curran, supra) . 

The recently recognized TWIK K + channel family includes the mammalian TWIK-1, TREK- 
1 and TASK proteins. Members of this family possess an overall structure with four transmembrane 
domains and two P domains. These proteins are probably involved in controlling the resting potential 

15 in a large set of cell types (Duprat, F. et al. (1997) EMBO J 16:5464-5471). 

The voltage-gated Ca 2+ channels have been classified into several subtypes based upon their 
electrophysiological and pharmacological characteristics. L-type Ca 2 * channels are predominantly 
expressed in heart and skeletal muscle where they play an essential role in excitation-contraction 
coupling. T-type channels are important for cardiac pacemaker activity, while N-type and P/Q-type 

20 channels are involved in the control of neurotransmitter release in the central and peripheral nervous 
system. The L-type and N-type voltage-gated Ca 2+ channels have been purified and, though their 
functions differ dramatically, they have similar subunit compositions. The channels are composed of 
three subunits. The subunit forms the membrane pore and voltage sensor, while the OjS and p 
subunits modulate the voltage-dependence, gating properties, and the current amplitude of the 

25 channel. These subunits are encoded by at least six a„ one 0^5, and four 0 genes. A fourth subunit, y, 
has been identified in skeletal muscle (Walker, D. et al. (1998) J. Biol. Chem. 273:2361-2367; 
McCleskey, E.W. (1994) Curr. Opin. Neurobiol. 4:304-312). 

The transient receptor family (Trp) of calcium ion channels are thought to mediate 
capacitative calcium entry (CCE). CCE is the Ca 2+ influx into cells to resupply Ca 2+ stores depleted 

30 by the action of inositol triphosphate (1P3) and other agents in response to numerous hormones and 
growth factors. Trp and Trp-like were first cloned from Drosophila and have similarity to voltage 
gated Ca2+ channels in the S3 through S6 regions. This suggests that Trp and/or related proteins may 
form mammalian CCC entry channels (Zhu, X. et al. (1996) Cell 85:661-671; Boulay, G. et al. (1997) 
J. Biol. Chem. 272:29672-29680). Melastatin is a gene isolated in both the mouse and human, and 

35 whose expression in melanoma cells is inversely correlated with melanoma aggressiveness in vivo. 
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The human cDNA transcript corresponds to a 1533-amino acid protein having homology to members 
of the Trp family. It has been proposed that the combined use of malastatin mRNA expression status 
and tumor thickness might allow for the determination of subgroups of patients at both low and high 
risk for developing metastatic disease (Duncan, L.M. et al (2001) J. Clin. Oncol. 19:568-576). 
5 Chloride channels are necessary in endocrine secretion and in regulation of cytosolic and 

organelle pH. In secretory epithelial cells, CI " enters the cell across a basolateral membrane through 
an Na + , K7C1 ' cotransporter, accumulating in the cell above its electrochemical equilibrium 
concentration. Secretion of CI" from the apical surface, in response to hormonal stimulation, leads to 
flow of Na + and water into the secretory lumen. The cystic fibrosis transmembrane conductance 

10 regulator (CFTR) is a chloride channel encoded by the gene for cystic fibrosis, a common fatal 
genetic disorder in humans. CFTR is a member of the ABC transporter family, and is composed of 
two domains each consisting of six transmembrane domains followed by a nucleotide-binding site. 
Loss of CFTR function decreases transepithelial water secretion and, as a result, the layers of mucus 
that coat the respiratory tree, pancreatic ducts, and intestine are dehydrated and difficult to clear. The 

15 resulting blockage of these sites leads to pancreatic insufficiency, "meconium ileus", and devastating 
"chronic obstructive pulmonary disease" (Al-Awqati, Q. et al. (1992) J. Exp. Biol. 172:245-266). 

The voltage-gated chloride channels (CLC) are characterized by 10-12 transmembrane 
domains, as well as two small globular domains known as CBS domains. The CLC subunits 
probably function as homotetramers. CLC proteins are involved in regulation of cell volume, 

20 membrane potential stabilization, signal transduction, and transepithelial transport Mutations in 
CLC-1, expressed predominantly in skeletal muscle, are responsible for autosomal recessive 
generalized myotonia and autosomal dominant myotonia congenita, while mutations in the kidney 
channel CLC-5 lead to kidney stones (Jentsch, TJ. (1996) Cuir. Opin. Neurobiol. 6:303-310). 

Ligand-gated channels open their pores when an extracellular or intracellular mediator binds 

25 to the channel. Neurotransmitter-gated channels are channels that open when a neurotransmitter 
binds to their extracellular domain. These channels exist in the postsynaptic membrane of nerve or 
muscle cells. There are two types of neurotransmitter-gated channels. Sodium channels open in 
response to excitatory neurotransmitters, such as acetylcholine, glutamate, and serotonin. This 
opening causes an influx of Na + and produces the initial localized depolarization that activates the 

30 voltage-gated channels and starts the action potential. Chloride channels open in response to 
inhibitory neurotransmitters, such as y-aminobutyric acid (GABA) and glycine, leading to 
hyperpolarization of the membrane and the subsequent generation of an action potential. 
Neurotransmitter-gated ion channels have four transmembrane domains and probably function as 
pentameis (Jentsch, supra) . Amino acids in the second transmembrane domain appear to be important 

35 in determining channel permeation and selectivity (Sather, W.A. et al. (1994) Curr. Opin. Neurobiol. 
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4:313-323). 

Ligand-gated channels can be regulated by intracellular second messengers. For example, 
calcium-activated K + channels are gated by internal calcium ions. In nerve cells, an influx of calcium 
during depolarization opens K + channels to modulate the magnitude of the action potential (Ishi et al., 
5 supra) . The large conductance (BK) channel has been purified from brain and its subunit 

composition determined. The a subunit of the BK channel has seven rather than six transmembrane 
domains in contrast to voltage-gated K + channels. The extra transmembrane domain is located at the 
subunit N-terminus. A 28-amino-acid stretch in the C-tenninal region of the subunit (the "calcium 
bowl" region) contains many negatively charged residues and is thought to be the region responsible 

10 for calcium binding. The P subunit consists of two transmembrane domains connected by a 

glycosylated extracellular loop, with intracellular N- and C-tennini (Kaczorowski, supra ; Vergara, C. 
et al. (1998) Curr. Opin. Neurobiol. 8:321-329). 

Cyclic nucleotide-gated (CNG) channels are gated by cytosolic cyclic nucleotides. The best 
examples of these are the cAMP-gated Na + channels involved in olfaction and the cGMP-gated cation 

15 channels involved in vision. Both systems involve ligand-mediated activation of a G-protein coupled 
receptor which then alters the level of cyclic nucleotide within the cell. CNG channels also represent 
a major pathway for Ca 2+ entry into neurons, and play roles in neuronal development and plasticity. 
CNG channels are tetramers containing at least two types of subunits, an a subunit which can form 
functional homomeric channels, and a P subunit, which modulates the channel properties. All CNG 

20 subunits have six transmembrane domains and a pore forming region between the fifth and sixth 

transmembrane domains, similar to voltage-gated K + channels. A large C-terminal domain contains a 
cyclic nucleotide binding domain, while the N-terminal domain confers variation among channel 
subtypes (Zufall, F. et al. (1997) Curr. Opin. Neurobiol. 7:404-412). 

The activity of other types of ion channel proteins may also be modulated by a variety of 

25 intracellular signalling proteins. Many channels have sites for phosphorylation by one or more 

protein kinases including protein kinase A, protein kinase C, tyrosine kinase, and casein kinase II, all 
of which regulate ion channel activity in cells. Kir channels are activated by the binding of the Gffy 
subunits of heterotrimedc G-proteins (Reimann, F. and FJML Ashcroft (1999) Curr. Opin. Cell. Biol. 
1 1:503-508). Other proteins are involved in the localization of ion channels to specific sites in the 

30 cell membrane. Such proteins include the PDZ domain proteins known as MAGUKs (membrane- 
associated guanylate kinases) which regulate the clustering of ion channels at neuronal synapses 
(Craven, S.E. and D.S. Bredt (1998) Cell 93:495-498). 
Disease Correlation 

The etiology of numerous human diseases and disorders can be attributed to defects in the 

10 
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transport of molecules across membranes. Defects in the trafficking of membrane-bound transporters 

and ion channels are associated with several disorders, e.g., cystic fibrosis, glucose-galactose * 
malabsorption syndrome, hypercholesterolemia, von Gierke disease, and certain forms of diabetes 
mellitus. Single-gene defect diseases resulting in an inability to transport small molecules across 
5 membranes include, e.g., cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease (vant Hoff, 
W.G. (1996) Exp. Nephrol. 4:253-262; Talente, G.M. et al. (1994) Ann. Intern. Med. 120:218-226; 
and Chillon, M. et al. (1995) New Engl. J. Med. 332:1475-1480). 

Human diseases caused by mutations in ion channel genes include disorders of skeletal 
muscle, cardiac muscle, and the central nervous system. Mutations in the pore-forming subunits of 

10 sodium and chloride channels cause myotonia, a muscle disorder in which relaxation after voluntary 
contraction is delayed. Sodium channel myotonias have been treated with channel blockers. 
Mutations in muscle sodium and calcium channels cause forms of periodic paralysis, while mutations 
in the sarcoplasmic calcium release channel, T-tubule calcium channel, and muscle sodium channel 
cause malignant hyperthermia. Cardiac arrythmia disorders such as the long QT syndromes and 

15 idiopathic ventricular fibrillation are caused by mutations in potassium and sodium channels (Cooper, 
E.C. and L.Y. Jan (1998) Proc. Natl. Acad. Sci. USA 96:4759-4766). All four known human 
idiopathic epilepsy genes code for ion channel proteins (Berkovic, SJF. and IJB. Scheffer (1999) Curr. 
Opin. Neurology 12:177-182). Other neurological disorders such as ataxias, hemiplegic migraine and 
hereditary deafness can also result from mutations in ion channel genes (Jen, J. (1999) Curr. Opin. 

20 Neurobiol. 9:274-280; Cooper, supra) . 

Ion channels have been the target for many drug therapies. Neurotransmitter-gated channels 
have been targeted in therapies for treatment of insomnia, anxiety, depression, and schizophrenia. 
Voltage-gated channels have been targeted in therapies for arrhythmia, ischemic stroke, head trauma, 
and neurodegenerative disease (Taylor, CP. and L.S. Narasimhan (1997) Adv. Pharmacol. 39:47-98). 

25 Various classes of ion channels also play an important role in the perception of pain, and thus are 
potential targets for new analgesics. These include the vanilloid-gated ion channels, which are 
activated by the vanilloid capsaicin, as well as by noxious heat Local anesthetics such as lidocaine 
and mexiletine which blockade voltage-gated Na + channels have been useful in the treatment of 
neuropathic pain (Eglen, supra) . 

30 Ion channels in the immune system have recently been suggested as targets for 

immunomodulation. T-cell activation depends upon calcium signaling, and a diverse set of T-cell 
specific ion channels has been characterized that affect this signaling process. Channel blocking 
agents can inhibit secretion of lymphokines, cell proliferation, and killing of target cells. A peptide 
antagonist of the T-cell potassium channel Kvl.3 was found to suppress delayed-type hypersensitivity 

35 and allogenic responses in pigs, validating the idea of channel blockers as safe and efficacious 
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immunosuppressants (Cahalan, M.D. and K.G. Chandy (1997) Curr. Opin. Biotechnol. 8:749-756). 

The discovery of new transporters and ion channels, and the polynucleotides encoding them, 
satisfies a need in the art hy providing new compositions which are useful in the diagnosis, 
prevention, and treatment of transport, neurological, muscle, immunological, and cell proliferative 
5 disorders, and in the assessment of the effects of exogenous compounds on the expression of nucleic 
acid and amino acid sequences of transporters and ion channels. 

SUMMARY OF THE INVENTION 

The invention features purified polypeptides, transporters and ion channels, referred to 

10 collectively as 'TRIOT and individuaUy as "TRICH-1," "TRICH-2," 'TRICH~3," 'TRICH-4," 
"TRICH-5," 'TRICH-6," 'TRICH-7/' 'TRICH-8/' 'TRICH-9," 'TRICH-IO," 'TRICH-11," 
"TRICH-12," "TOICH-13," 'TRICH-14," 'TRICH-15," "TRICH-16," "TRICH-17," "TRICH-18," 
'TRICH-19," "TRICH-20," 'TRICH-21," "TRICH-22," 'TRICH-23," 'TRICH-24," "TRICH-25," 
'TRICH-26," "TRICH-27," 'TRICH-28," "TRICH-29," and 'TRICH-30." In one aspect, the 

15 invention provides an isolated polypeptide selected from the group consisting of a) a polypeptide 
comprising an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30, b) a 
polypeptide comprising a naturally occurring amino acid sequence at least 90% identical to an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-30, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 

20 ID NO: 1-30, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 

selected from the group consisting of SEQ ID NO: 1-30. In one alternative, the invention provides an 
isolated polypeptide comprising the amino acid sequence of SEQ ID NO: 1-30. 

The invention further provides an isolated polynucleotide encoding a polypeptide selected 
from the group consisting of a) a polypeptide comprising an amino acid sequence selected from the 

25 group consisting of SEQ ID NO: 1-30, b) a polypeptide comprising a naturally occurring amino acid 
sequence at least 90% identical to an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-30, c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-30, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30. 

30 In one alternative, the polynucleotide encodes a polypeptide selected from the group consisting of 
SEQ ID NO: 1-30. hi another alternative, the polynucleotide is selected from the group consisting of 
SEQIDNO:31-60. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide selected from the group 
35 consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
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of SEQ ID NO: 1-30, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30, c) a 
biologically active fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-30, and d) an immunogenic fragment of a polypeptide having an amino 
5 acid sequence selected from the group consisting of SEQ ID NO: 1-30. In one alternative, the 

invention provides a cell transformed with the recombinant polynucleotide. In another alternative, the 
invention provides a transgenic organism comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 

10 of SEQ ID NO: 1-30, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30, c) a 
biologically active fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-30, and d) an immunogenic fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-30. The method comprises a) 

15 culturing a cell under conditions suitable for expression of the polypeptide, wherein said cell is 

transformed with a recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide encoding the polypeptide, and b) recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 

20 sequence selected from the group consisting of SEQ ID NO: 1-30, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-30, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 

25 consisting of SEQ ID NO: 1-30. 

The invention further provides an isolated polynucleotide selected from the group consisting 
of a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:31-60, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at 
least 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 

30 NO:3 1-60, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide 

complementary to the polynucleotide of b), and e) an RNA equivalent of a>d). In one alternative, the 
polynucleotide comprises at least 60 contiguous nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a 
sample, said target polynucleotide having a sequence of a polynucleotide selected from the group 

35 consisting of a) a polynucleotide comprising a polynucleotide sequence selected from the group 
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consisting of SEQ ID NO:31-60, b) a polynucleotide comprising a naturally occurring polynucleotide 
sequence at least 90% identical to a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:31-60, c) a polynucleotide complementary to the polynucleotide of a), d) a 
polynucleotide complementary to the polynucleotide of b), and e) an RNA equivalent of a)-d). The 
5 method comprises a) hybridizing the sample with a probe comprising at least 20 contiguous 

nucleotides comprising a sequence complementary to said target polynucleotide in the sample, and 
which probe specifically hybridizes to said target polynucleotide, under conditions whereby a 
hybridization complex is formed between said probe and said target polynucleotide or fragments 
thereof, and b) detecting the presence or absence of said hybridization complex, and optionally, if 
10 present, the amount thereof. In one alternative, the probe comprises at least 60 contiguous 
nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide selected from the group consisting 
of a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of 

15 SEQ ID NO:31-60, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at 
least 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ED 
NO:31-60, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide 
complementary to the polynucleotide of b), and e) an RNA equivalent of a)-d). The method 
comprises a) amplifying said target polynucleotide or fragment thereof using polymerase chain 

20 reaction amplification, and b) detecting the presence or absence of said amplified target 
polynucleotide or fragment thereof, and, optionally, if present, the amount thereof. 

The invention further provides a composition comprising an effective amount of a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-30, b) a polypeptide comprising a 

25 naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-30, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-30, and a pharmaceutically acceptable excipient. In one embodiment, the 

30 composition comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 1- 
30. The invention additionally provides a method of treating a disease or condition associated with 
decreased expression of functional TRICH, comprising administering to a patient in need of such 
treatment the composition. 

The invention also provides a method for screening a compound for effectiveness as an 

35 agonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
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acid sequence selected from the group consisting of SEQ ID NO:1-30, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an anrino acid sequence selected 
from the group consisting of SEQ ID NO: 1-30, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30, and d) an 

5 immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-30. The method comprises a) exposing a sample comprising the 
polypeptide to a compound, and b) detecting agonist activity in the sample. In one alternative, the 
invention provides a composition comprising an agonist compound identified by the method and a 
pharmaceutically acceptable excipient. In another alternative, the invention provides a method of 

10 treating a disease or condition associated with decreased expression of functional TRICH, comprising 
administering to a patient in need of such treatment the composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 
an antagonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-30, b) a polypeptide 

15 comprising a naturally occurring amino acid sequence at least 90% identical to an amino acid 

sequence selected from the group consisting of SEQ ID NO: 1-30, c) a biologically active fragment of 
a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30, 
and d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-30. The method comprises a) exposing a sample comprising the 

20 polypeptide to a compound, and b) detecting antagonist activity in the sample. In one alternative, the 
invention provides a composition comprising an antagonist compound identified by the method and a 
pharmaceutically acceptable excipient. In another alternative, the invention provides a method of 
treating a disease or condition associated with overexpression of functional TRICH, comprising 
administering to a patient in need of such treatment the composition. 

25 The invention further provides a method of screening for a compound that specifically binds 

to a polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-30, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-30, c) a biologically active fragment of a polypeptide 

30 having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-30. The method comprises a) combining the polypeptide with at least 
one test compound under suitable conditions, and b) detecting binding of the polypeptide to the test 
compound, thereby identifying a compound that specifically binds to the polypeptide. 
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The invention further provides a method of screening for a compound that modulates the 
activity of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-30, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
5 from the group consisting of SEQ ID NO:1-30, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-30. The method comprises a) combining the polypeptide with at least 
one test compound under conditions permissive for the activity of the polypeptide, b) assessing the 

10 activity of the polypeptide in the presence of the test compound, and c) comparing the activity of the 
polypeptide in the presence of the test compound with the activity of the polypeptide in the absence 
of the test compound, wherein a change in the activity of the polypeptide in the presence of the test 
compound is indicative of a compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in 

15 altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:31-60, the method 
comprising a) exposing a sample comprising the target polynucleotide to a compound, and b) 
detecting altered expression of the target polynucleotide. 

The invention further provides a method for assessing toxicity of a test compound, said 

20 method comprising a) treating a biological sample containing nucleic acids with the test compound; 
b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide selected from the group consisting of i) a polynucleotide 
comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:31-60, ii) a 
polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical to a 

25 polynucleotide sequence selected from the group consisting of SEQ ID NO:3 1-60, iii) a 

polynucleotide having a sequence complementary to i), iv) a polynucleotide complementary to the 
polynucleotide of ii), and v) an RNA equivalent of i)~iv). Hybridization occurs under conditions 
whereby a specific hybridization complex is formed between said probe and a target polynucleotide 
in the biological sample, said target polynucleotide selected from the group consisting of i) a 

30 polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:31-60, ii) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:31-60, 
iii) a polynucleotide complementary to the polynucleotide of i), iv) a polynucleotide complementary 
to the polynucleotide of ii), and v) an RNA equivalent of i>iv). Alternatively, the target 
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polynucleotide comprises a fragment of a polynucleotide sequence selected from the group consisting 
of i>v) above; c) quantifying the amount of hybridization complex; and d) comparing the amount of 
hybridization complex in the treated biological sample with the amount of hybridization complex in 
an untreated biological sample, wherein a difference in the amount of hybridization complex in the 
5 treated biological sample is indicative of toxicity of the test compound. 

BRIEF DESCRIPTION OF THE TABLES 
Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 
sequences of the present invention. 
10 Table 2 shows the GenBank identification number and annotation of the nearest GenBank 

homolog for polypeptides of the invention. The probability score for the match between each 
polypeptide and its GenBank homolog is also shown. 

Table 3 shows structural features of polypeptide sequences of the invention, including 
predicted motifs and domains, along with the methods, algorithms, and searchable databases used for 
15 analysis of the polypeptides. 

Table 4 lists the cDNA and/or genomic DNA fragments which were used to assemble 
polynucleotide sequences of the invention, along with selected fragments of the polynucleotide 
sequences. 

Table 5 shows the representative cDNA library for polynucleotides of the invention. 
20 Table 6 provides an appendix which describes the tissues and vectors used for construction of 

the cDNA libraries shown in Table 5. 

Table 7 shows the tools, programs, and algorithms used to analyze the polynucleotides and 
polypeptides of the invention, along with applicable descriptions, references, and threshold 
parameters. 

25 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 
that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. R is also to be understood that the terminology used herein is for the purpose of describing 
30 particular emrxxlirnents only, and is not intended to limit the scope of the present invention which 
will be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," "an," 
and 'the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host celT includes a plurality of such host cells, and a reference to "an antibody" is a 
35 reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 
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forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to those described herein can be 
5 used to practice or test the present invention, the prefeixed machines, materials and methods are now 
described. All publications mentioned herein are cited for the purpose of describing and disclosing 
the cell lines, protocols, reagents and vectors which are reported in the publications and which might 
be used in connection with the invention. Nothing herein is to be construed as an admission that the 
invention is not entitled to antedate such disclosure by virtue of prior invention. 
10 DEFINITIONS 

"TRICH" refers to the amino acid sequences of substantially purified TRICH obtained from 
any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
15 TRICH. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of TRICH either by directly interacting with 
TRICH or by acting on components of the biological pathway in which TRICH participates. 

An "allelic variant" is an alternative form of the gene encoding TRICH. Allelic variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
20 polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times 
in a given sequence. 

25 "Altered" nucleic acid sequences encoding TRICH include those sequences with deletions, 

insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as TRICH or a 
polypeptide with at least one functional characteristic of TRICH. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe 
of the polynucleotide encoding TRICH, and improper or unexpected hybridization to allelic variants, 

30 with a locus other than the normal chromosomal locus for the polynucleotide sequence encoding 
TRICH. The encoded protein may also be "altered," and may contain deletions, insertions, or 
substitutions of amino acid residues which produce a silent change and result in a functionally 
equivalent TRICH. Deliberate amino acid substitutions may be made on the basis of similarity in 
polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the 

35 residues, as long as the biological or immunological activity of TRICH is retained. For example, 
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negatively charged amino acids may include aspartic acid and glutamic acid, and positively charged 
amino acids may include lysine and arginine. Amino acids with uncharged polar side chains having 
similar hydrophilicity values may include: asparagine and glutamine; and serine and threonine. 
Amino acids with uncharged side chains having similar hydrophilicity values may include: leucine, 
5 isoleucine, and valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring 
protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid 
10 sequence to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well 
known in the art 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
15 of TRICH. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of TRICH either by 
directly interacting with TRICH or by acting on components of the biological pathway in which 
TRICH participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments 
20 thereof, such as Fab, F(ab')2, and Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind TRICH polypeptides can be prepared using intact polypeptides or using 
fragments containing small peptides of interest as the immunizing antigen. The polypeptide or 
oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the 
translation of RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. 
25 Commonly used carriers that are chemically coupled to peptides include bovine serum albumin, 

thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize 
the animal. 

The term "antigenic determinant" refers to that region of a molecule (Le., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
30 immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (particular regions or three-dimensional structures 
on the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
used to elicit the immune response) for binding to an antibody. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" 
35 (coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; 
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RNA; peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2 -methoxyethyl sugars or 2 -methoxyethoxy sugars; or oligonucleotides having 
modified bases such as 5-methyl cytosine, 2 , -deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense 
5 molecules may be produced by any method including chemical synthesis or transcription. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 
nucleic acid sequence produced by the cell to form duplexes which block either transcription or 
translation. The designation "negative" or "minus" can refer to the antisense strand, and the 
designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 
10 The term "biologically active" refers to a protein having structural, regulatory, or biochemical 

functions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic TRICH, or of any oligopeptide 
thereof, to induce a specific immune response in appropriate animals or cells and to bind with specific 
antibodies. 

15 "Complementary" describes the relationship between two single-stranded nucleic acid 

sequences that anneal by base-pairing. For example, 5-AGT-3' pairs with its complement, 
3'-TCA-5\ 

A "composition comprising a given polynucleotide sequence" and a "composition comprising 
a given amino acid sequence" refer broadly to any composition containing the given polynucleotide 

20 or amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
Conq)ositions comprising polynucleotide sequences encoding TRICH or fragments of TRICH may be 
employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl 

25 sulfate; SDS), and other components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (Applied 
Biosystems, Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which has been 
assembled from one or more overlapping cDNA, EST, or genomic DNA fragments using a computer 

30 program for fragment assembly, such as the GELVIEW fragment assembly system (GCG, Madison 
WT) or Phrap (University of Washington, Seattle WA). Some sequences have been both extended and 
assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of 

35 the protein is conserved and not significantly changed by such substitutions. The table below shows 
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amino acids which may be substituted for an original amino acid in a protein and which are regarded 
as conservative amino acid substitutions. 





Original Residue 
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Arg 
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Leu 
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Lys 


Arg, Gin, Glu 




Met 


Leu, lie 




Phe 


tfis, Met, Leu, Tip, Tyr 




Ser 


Cys, Thr 




Thr 


Ser, Val 


20 


Trp 


Phe, Tyr 




Tyr 


His, Phe, Tip 




Val 


De, Leu, Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
25 backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of 
the side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 
30 The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 

Chemical modifications of a polynucleotide can include, for example, replacement of hydrogen by an 
alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which 
retains at least one biological or immunological function of the natural molecule. A derivative 
polypeptide is one modified by glycosylation, pegylation, or any similar process that retains at least 
35 one biological or immunological function of the polypeptide from which it was derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

"Differential expression" refers to increased or upregulated; or decreased, downregulated, or 
absent gene or protein expression, determined by comparing at least two different samples. Such 
40 comparisons may be carried out between, for example, a treated and an untreated sample, or a 
diseased and a normal sample. 
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"Exon shuffling" refers to the recombination of different coding regions (exons). Since an 
exon may represent a structural or functional domain of the encoded protein, new proteins may be 
assembled through the novel reassortment of stable substructures, thus allowing acceleration of the 
evolution of new protein functions. 
5 A 'fragment" is a unique portion of TRICH or the polynucleotide encoding TRICH which is 

identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 
to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a 
fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 
used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 

10 15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 
residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 
from the first 250 or 500 amino acids (or first 25% or 50%) of a polypeptide as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 

15 specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 
embodiments. 

A fragment of SEQ ID NO:31-60 comprises a region of unique polynucleotide sequence that 
specifically identifies SEQ ID NO:31-60, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:31-60 is useful, for 
20 example, in hybridization and amplification technologies and in analogous methods that distinguish 
SEQ ID NO:31-60 from related polynucleotide sequences. The precise length of a fragment of SEQ 
ID NO:31-60 and the region of SEQ ID NO:31-60 to which the fragment corresponds are routinely 
determinable by one of ordinary skill in the art based on the intended purpose for the fragment. 

A fragment of SEQ ID NO:1-30 is encoded by a fragment of SEQ ID NO:31-60. A fragment 
25 of SEQ ID NO: 1-30 comprises a region of unique amino acid sequence that specifically identifies 
SEQ ID NO: 1-30. For example, a fragment of SEQ ID NO: 1-30 is useful as an immunogenic peptide 
for the development of antibodies that specifically recognize SEQ ID NO: 1-30. The precise length of 
a fragment of SEQ ID NO: 1-30 and the region of SEQ ID NO: 1-30 to which the fragment 
corresponds are routinely determinable by one of ordinary skill in the art based on the intended 
30 purpose for the fragment. 

A "full length" polynucleotide sequence is one containing at least a translation initiation 
codon (e.g., methionine) followed by an open reading frame and a translation termination codon. A 
"full length" polynucleotide sequence encodes a "full length" polypeptide sequence. 

'Homology" refers to sequence similarity or, interchangeably, sequence identity, between 
35 two or more polynucleotide sequences or two or more polypeptide sequences. 
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The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps 
in the sequences being compared in order to optimize alignment between two sequences, and 
5 therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 12e 
sequence alignment program. This program is part of the LASERGENE software package, a suite of 
molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in 
10 Higgins, D.G. and PM. Sharp (1989) CABIOS 5: 151-153 and in Higgins, D.G. et al. (1992) CABIOS 
8: 189-191. For pairwise alignments of polynucleotide sequences, the default parameters are set as 
follows: Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue 
weight table is selected as the default. Percent identity is reported by CLUSTAL V as the "percent 
similarity" between aligned polynucleotide sequences. 
15 Alternatively, a suite of commonly used and freely available sequence comparison algorithms 

is provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment 
Search Tool (BLAST) (Altschul, S.R et al. (1990) J. Mol. Biol. 215:403-410), which is available 
from several sources, including the NCBI, Bethesda, MD, and on the Internet at 
http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence 
20 analysis programs including "blastn," that is used to align a known polynucleotide sequence with 

other polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http://www.ncbi.nlmnih.gov/gor&bl2.htnol. 
The "BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
25 programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.12 (April-21-2000) set at default parameters. Such default parameters may be, for example: 
Matrix: BLOSUM62 
Reward for match: 1 
30 Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 
Gap x drop-off: 50 
Expect: 10 
Word Size: 11 
35 Filter: on 
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Percent identity may be measured over the length of an entire defined sequence, for example, 
as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, 
over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at 
least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous 
5 nucleotides. Such lengths are exemplary only, and it is understood that any fragment length 

supported by the sequences shown herein, in the tables, figures, or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes , 
10 in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some 
15 alignment methods take into account conservative amino acid substitutions. Such conservative 

substitutions, explained in more detail above, generally preserve the charge and_hydrophobicity at the 
site of substitution, thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be detennined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
20 sequence alignment program (described and referenced above). For pairwise alignments of 

polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap 
penalty=3, window=5, and "diagonals saved"=5. The PAM250 matrix is selected as the default 
residue weight table. As with polynucleotide alignments, the percent identity is reported by 
CLUSTAL V as the "percent similarity" between aligned polypeptide sequence pairs. 
25 Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 

comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 
2.0.12 (April-21-2000) with blastp set at default parameters. Such default parameters may be, for 
example: 

Matrix: BLOSUM62 
30 Open Gap: 11 and Extension Gap: 1 penalties 

Gap x drop-off: 50 
Expect: 10 
Word Size: 3 
Filter: on 

35 Percent identity may be measured over the length of an entire defined polypeptide sequence, 
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for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 
instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 
150 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment 
5 length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be 
used to describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 

10 The term "humanized antibody" refers to an antibody molecule in which the amino acid 

sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

'Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 

15 hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the hashing" step(s). The washing step(s) is particularly important in determining the 
stringency of the hybridization process, with more stringent conditions allowing less non-specific 
binding, i.e., binding between pairs of nucleic acid strands that are not perfectly matched. Permissive 

20 conditions for annealing of nucleic acid sequences are routinely determinable by one of ordinary skill 
in the art and may be consistent among hybridization experiments, whereas wash conditions may be 
varied among experiments to achieve the desired stringency, and therefore hybridization specificity. 
Permissive annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 
1% (w/v) SDS, and about 100 /ig/ml sheared, denatured salmon sperm DNA. 

25 Generally, stringency of hybridization is expressed, in part, with reference to the temperature 

under which the wash step is carried out Such wash temperatures are typically selected to be about 
5°C to 20°C lower than the thermal melting point (T^ for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of 
the target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and 

30 conditions for nucleic acid hybridization are. well known and can be found in Sambrook, J. et al. 
(1989) Molecular aoning: A Laboratory Manual . 2 nd ed., vol. 1-3, Cold Spring Harbor Press, 
Plainview NY; specifically see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present 
invention include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, 

35 for 1 hour. Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC 
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concentration may be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1%. 
Typically, blocking reagents are used to block non-specific hybridization. Such blocking reagents 
include, for instance, sheared and denatured salmon sperm DNA at about 100-200 /zg/ml. Organic 
solvent, such as formamide at a concentration of about 35-50% v/v, may also be used under particular 
5 circumstances, such as for RNA:DNA hybridizations. Useful variations on these wash conditions 
will be readily apparent to those of ordinary skill in the art Hybridization, particularly under high 
stringency conditions, may be suggestive of evolutionary similarity between the nucleotides. Such 
similarity is strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 
The term "hybridization complex" refers to a complex formed between two nucleic acid 

10 sequences by virtue of the formation of hydrogen bonds between complementary bases. A 

hybridization complex may be formed in solution (e.g., Qjt or I^t analysis) or formed between one 
nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid 
support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate 
to which cells or their nucleic acids have been fixed). 

15 The words "insertion" and "addition" refer to changes in an amino acid or nucleotide 

sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 

20 cellular and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of TRICH which is 
capable of eliciting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment 
of TRICH which is useful in any of the antibody production methods disclosed herein or known in the 

25 art 

The term "microarray" refers to an arrangement of a plurality of polynucleotides, 
polypeptides, or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microarray. 
30 The term "modulate" refers to a change in the activity of TRICH. For example, modulation 

may cause an increase or a decrease in protein activity, binding characteristics, or any other 
biological, functional, or immunological properties of TRICH 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
35 synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
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antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
5 sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 

10 PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript 
elongation, and may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an TRICH may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemizatian, proteolytic cleavage, and other modifications known in 
the art These processes may occur synthetically or biochemically. Biochemical modifications will 

15 vary by cell type depending on the enzymatic milieu of TRICK 

"Probe" refers to nucleic acid sequences encoding TRICH, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. 
Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. 

20 <c Primers" are short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target 
polynucleotide by complementary base-pairing. The primer may then be extended along the target 
DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification (and 
identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 

25 nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 

30 Methods for preparing and using probes and primers are described in the references, for 

example Sambrook, J. et al. (1989) Molecular Cloning: A Laboratory Manual . 2 nd ed., vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel, RM. et al. (1987) Current Protocols in Molecular 
Biology , Greene Publ. Assoc. & Wiley-lntersciences, New York NY; Innis, M. et al. (1990) PCR 
Protocols. A Guide to Methods and Applications. Academic Press, San Diego CA. PCR primer pairs 

35 can be derived from a known sequence, for example, by using computer programs intended for that 
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purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge 
MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
5 100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 
5,000 nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer 
selection programs have incorporated additional features for expanded capabilities. For example, the 
PrimOU primer selection program (available to the public from the Genome Center at University of 
Texas South West Medical Center, Dallas TX) is capable of choosing specific primers from 

10 megabase sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 
primer selection program (available to the public from the Whitehead Institute/MIT Center for 
Genome Research, Cambridge MA) allows the user to input a "misprinting library," in which 
sequences to avoid as primer binding sites are user-specified. Primer3 is useful, in particular, for the 
selection of oligonucleotides for microarrays. (The source code for the latter two primer selection 

15 programs may also be obtained from their respective sources and modified to meet the user's specific 
needs.) The PrimeGen program (available to the public from the UK Human Genome Mapping 
Project Resource Centre, Cambridge UK) designs primers based on multiple sequence alignments, 
thereby allowing selection of primers that hybridize to either the most conserved or least conserved 
regions of aligned nucleic acid sequences. Hence, this program is useful for identification of both 

20 unique and conserved oligonucleotides and polynucleotide fragments. The oligonucleotides and 

polynucleotide fragments identified by any of the above selection methods are useful in hybridization 
technologies, for example, as PCR or sequencing primers, microarray elements, or specific probes to 
identify fully or partially complementary polynucleotides in a sample of nucleic acids. Methods of 
oligonucleotide selection are not limited to those described above. 

25 A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 

that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 

30 been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter 
sequence. Such a recombinant nucleic acid may be part of a vector that is used, for example, to 
transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
35 vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
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expressed, inducing a protective immunological response in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, and 5' and 3 1 untranslated regions 
(UTRs). Regulatory elements interact with host or viral proteins which control transcription, 
5 translation, or RNA stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemttunrinescent, or chromogenic agents; substrates; cof actors; inhibitors; magnetic particles; and 
other moieties known in the art 

10 An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 

sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing TRICH, 

15 nucleic acids encoding TRICH, or fragments thereof may comprise a bodily fluid; an extract from a 
cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or 
cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 

20 synthetic binding composition. The interaction is dependent upon the presence of a particular 
structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding 
molecule. For example, if an antibody is specific for epitope "A," the presence of a polypeptide 
comprising the epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A 
and the antibody will reduce the amount of labeled A that binds to the antibody. 

25 The term "substantially purified" refers to nucleic acid or amino acid sequences that are 

removed from their natural environment and are isolated or separated, and are at least 60% free, 
preferably at least 75% free, and most preferably at least 90% free from other components with which 
they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides 

30 by different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

35 A "transcript image" refers to the collective pattern of gene expression by a particular cell 

29 



WO 02/12340 



PCT/US01/24217 



type or tissue under given conditions at a given time. 

'Transformation" describes a process by which exogenous DNA is introduced into a recipient 
cell. Transformation may occur under natural or artificial conditions according to various methods 
well known in the art, and may rely on any known method for the insertion of foreign nucleic acid 
5 sequences into a prokaryotic or eukaryotic host cell. The method for transformation is selected based 
on the type of host cell being transformed and may include, but is not limited to, bacteriophage or 
viral infection, electroporation, heat shock, lipofection, and particle bombardment. The term 
"transformed cells" includes stably transformed cells in which the inserted DNA is capable of 
replication either as an autonomously replicating plasmid or as part of the host chromosome, as well 

10 as transiently transformed cells which express the inserted DNA or RNA for limited periods of time. 
A "transgenic organism," as used herein, is any organism, including but not limited to 
animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by transgenic techniques well known in the 
art The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor 

15 of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with 
a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in 
vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cyanobacteria, fungi, plants and animals. The isolated DNA of the present invention can be 

20 introduced into the host by methods known in the art, for example infection, transfection, 

transformation or transconjugation. Techniques for transferring the DNA of the present invention 
into such organisms are widely known and provided in references such as Sambrook et al. (1989), 
supra . 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 
25 at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 
the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 
93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater 
30 sequence identity over a certain defined length. A variant may be described as, for example, an 

"allelic" (as defined above), "splice," "species," or "polymorphic" variant A splice variant may have 
significant identity to a reference molecule, hut will generally have a greater or lesser number of 
polynucleotides due to alternate splicing of exons during mRNA processing. The corresponding 
polypeptide may possess additional functional domains or lack domains that are present in the 
35 reference molecule. Species variants are polynucleotide sequences that vary from one species to 
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another. The resulting polypeptides will generally have significant amino acid identity relative to 
each other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene 
between individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The 
5 presence of SNPs mayjbe indicative of, for example, a certain population, a disease state, or a 
propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 
at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of 
the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
10 1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 90%, at least 91%, at least 92%, at least 93%, at least 
94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater sequence 
identity over a certain defined length of one of the polypeptides. 

15 THE INVENTION 

The invention is based on the discovery of new human transporters and ion channels 
(TRICH), the polynucleotides encoding TRICH, and the use of these compositions for the diagnosis, 
treatment, or prevention of transport, neurological, muscle, immunological, and cell proliferative 
disorders. 

20 Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 

sequences of the invention. Each polynucleotide and its corresponding polypeptide are correlated to a 
single Incyte project identification number (Incyte Project ID). Each polypeptide sequence is denoted 
by both a polypeptide sequence identification number (Polypeptide SEQ ID NO:) and an Incyte 
polypeptide sequence number (Incyte Polypeptide ID) as shown. Each polynucleotide sequence is 

25 denoted by both a polynucleotide sequence identification number (Polynucleotide SEQ ID NO:) and 
an Incyte polynucleotide consensus sequence number (Incyte Polynucleotide ID) as shown. 

Table 2 shows sequences with homology to the polypeptides of the invention as identified by 
BLAST analysis against the GenBank protein (genpept) database. Columns 1 and 2 show the 
polypeptide sequence identification number (Polypeptide SEQ ID NO:) and the corresponding Incyte 

30 polypeptide sequence number (Incyte Polypeptide ID) for polypeptides of the invention. Column 3 
shows the GenBank identification number (Genbank ID NO:) of the nearest GenBank homolog. 
Column 4 shows the probability score for the match between each polypeptide and its GenBank 
homolog. Column 5 shows the annotation of the GenBank homolog along with relevant citations 
where applicable, all of which are expressly incorporated by reference herein. 

35 Table 3 shows various structural features of the polypeptides of the invention. Columns 1 
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and 2 show the polypeptide sequence identification number (SEQ ID NO:) and the corresponding 
Incyte polypeptide sequence number (Incyte Polypeptide ID) for each polypeptide of the invention. 
Column 3 shows the number of amino acid residues in each polypeptide. Column 4 shows potential 
phosphorylation sites, and column 5 shows potential glycosylation sites, as determined by the 
5 MOTIFS program of the GCG sequence analysis software package (Genetics Computer Group, 
Madison WI). Column 6 shows amino acid residues comprising signature sequences, domains, and 
motifs. Column 7 shows analytical methods for protein structure/function analysis and in some cases, 
searchable databases to which the analytical methods were applied. 

Together, Tables 2 and 3 summarize the properties of polypeptides of the invention, and these 

10 properties establish that the claimed polypeptides are transporters and ion channels. For example, 
SEQ ID NO:6 is 89% identical to rat neuronal nicotinic acetylcholine receptor subunit (GenBank ID 
g6746563) as determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The 
BLAST probability score is L7e-188, which indicates the probability of obtaining the observed 
polypeptide sequence alignment by chance. SEQ ID NO:6 also contains a neurotransmirter-gated ion 

15 channel domain as determined by searching for statistically significant matches in the hidden Markov 
model (HMM)-based PFAM database of conserved protein family domains. (See Table 3.) Data from 
BLIMPS, MOTIFS, and PROFILESCAN analyses provide further corroborative evidence that SEQ 
ID NO:6 is a neurotransmitter-gated ion channel. In an alternative example, SEQ ID NO: 14 is 93% 
identical to rat TAP-like ABC transporter (GenBank ID g6045150) as determined by the Basic Local 

20 Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score is 0.0, which 

indicates the probability of obtaining the observed polypeptide sequence alignment by chance. SEQ 
ID NO: 14 also contains an ABC transporter domain and an ABC transporter transmembrane region as 
determined by searching for statistically significant matches in the hidden Markov model (HMM)- 
based PFAM database of conserved protein family domains. (See Table 3.) Data from BLIMPS, 

25 MOTIFS, and PROFILESCAN analyses provide further corroborative evidence that SEQ ID NO: 14 is 
an ABC transporter. In an alternative example, SEQ ID NO: 16 is 98% identical to human voltage- 
dependent anion channel (GenBank ID g340199) as determined by the Basic Local Alignment Search 
Tool (BLAST). (See Table 2.) The BLAST probability score is 1.2e-130, which indicates the 
probability of obtaining the observed polypeptide sequence alignment by chance. SEQ ID NO: 16 

30 also contains a eukaryotic porin active site domain as determined by searching for statistically 

significant matches in the hidden Markov model (HMM)-based PFAM database of conserved protein 
family domains. (See Table 3.) Data from BLIMPS, MOTIFS, and PROFILESCAN analyses provide 
further corroborative evidence that SEQ ID NO: 16 is a mitochondrial porin. Iu an alternative 
example, SEQ ID NO:20 is 28% identical to a rat voltage-gated calcium channel (GenBank ID 

35 g4586963) as determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The 
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BLAST probability score is 2.4e-27, which indicates the probability of obtaining the observed 
polypeptide sequence alignment by chance. Data from BLIMPS and BLAST analyses provide further 
corroborative evidence that SEQ ID NO:20 is a voltage-gated calcium channel. In an alternative 
example, SEQ ID NO:22 is 82% identical to human inhibitory glycine receptor (GenBank ID g31849) 
5 as determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST 
probability score is l.le-175, which indicates the probability of obtaining the observed polypeptide 
sequence alignment by chance. SEQ ID NO:22 also contains a neurotransmitter-gated ion channel 
domain as determined by searching for statistically significant matches in the hidden Markov model 
(HMM)-based PFAM database of conserved protein family domains. (See Table 3.) Data from 

10 BLIMPS, MOTIFS, and PROHLESCAN analyses provide further corroborative evidence that SEQ 
ID NO:22 is a neurotransmitter-gated ion channel. In an alternative example, SEQ ID NO:30 is 36% 
identical to human ATP binding cassette (ABC) -C transporter (GenBank ID gl5 14530) as 
determined by the Basic Local Alignment Search Tool (BLAST, see Table 2). The BLAST 
probability score is 2.3e-127, which indicates the probability of obtaining the observed polypeptide 

15 sequence alignment by chance. SEQ ID NO:30 also contains ABC transporter domains as determined 
by searching for statistically significant matches in the hidden Markov model (HMM>based PFAM 
database of conserved protein family domains (see Table 3). Data from BLIMPS, MOTIFS, and 
PROFUJBSCAN analyses provide further corroborative evidence that SEQ ID NO:30 is an ABC 
transporter. SEQ ID NO:l-5, SEQ ID NO:7-13, SEQ ID NO:15, SEQ ID NO:17-19, SEQ ID NO:21, 

20 and SEQ ID NO:23-29 were analyzed and annotated in a similar manner. The algorithms and 
parameters for the analysis of SEQ ID NO: 1-30 are described in Table 7. 

As shown in Table 4, the full length polynucleotide sequences of the present invention were 
assembled using cDNA sequences or coding (exon) sequences derived from genomic DNA, or any 
combination of these two types of sequences. Columns 1 and 2 list the polynucleotide sequence 

25 identification number (Polynucleotide SEQ ID NO:) and the corresponding Incyte polynucleotide 
consensus sequence number (Incyte Polynucleotide ID) for each polynucleotide of the invention. 
Column 3 shows the length of each polynucleotide sequence in basepairs. Column 4 lists fragments 
of the polynucleotide sequences which are useful, for example, in hybridization or amplification 
technologies that identify SEQ ID NO:31-60 or that distinguish between SEQ ID NO:31-60 and 

30 related polynucleotide sequences. Column 5 shows identification numbers corresponding to cDNA 
sequences, coding sequences (exons) predicted from genomic DNA, and/or sequence assemblages 
comprised of both cDNA and genomic DNA. These sequences were used to assemble the full length 
polynucleotide sequences of the invention. Columns 6 and 7 of Table 4 show the nucleotide start (5') 
and stop (3') positions of the cDNA and/or genomic sequences in column 5 relative to their respective 

35 full length sequences. 
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The identification numbers in Column 5 of Table 4 may refer specifically, for example, to 
Incyte cDNAs along with their corresponding cDNA libraries. For example, 6340750H1 is the 
identification number of an Incyte cDNA sequence, and BRANDIN01 is the cDNA library from 
which it is derived. Incyte cDNAs for which cDNA libraries are not indicated were derived from 
5 pooled cDNA libraries (e.g., 7 191 1330V1). Alternatively, the identification numbers in column 5 
may refer to GenBank cDNAs or ESTs (e.g., g5 1 10579) which contributed to the assembly of the full 
length polynucleotide sequences. In addition, the identification numbers in column 5 may identify 
sequences derived from the ENSEMBL (The Sanger Centre, Cambridge, UK) database (Le. 9 those 
sequences including the designation "ENST"). Alternatively, the identification numbers in column 5 

10 may be derived from the NCBI RefSeq Nucleotide Sequence Records Database (i.e., those sequences 
including the designation "NM" or "NT") or the NCBI RefSeq Protein Sequence Records (i.e., those 
sequences including the designation "NP"). Alternatively, the identification numbers in column 5 
may refer to assemblages of both cDNA and Genscan-predicted exons brought together by an "exon 
stitching" algorithm. For example, FL JOCXXXXJfjJ^zJTYYYYJ^f^ represents a "stitched" 

15 sequence in which XXXXXX is the identification number of the cluster of sequences to which the 
algorithm was applied, and YYYYY is the number of the prediction generated by the algorithm, and 
N ] 2t3 ^ if present, represent specific exons that may have been manually edited during analysis (See 
Example V). Alternatively, the identification numbers in column 5 may refer to assemblages of 
exons brought together by an "exon-stretching" algorithm For example, 

20 FLXXXXXX_^AAAAA_gBBBBB_lJN is the identification number of a "stretched" sequence, with 
XXXXXX being the Incyte project identification number, gAAAAA being the GenBank identification 
number of the human genomic sequence to which the "exon-stretching" algorithm was applied, 
gBBBBB being the GenBank identification number or NCBI RefSeq identification number of the 
nearest GenBank protein homolog, and N referring to specific exons (See Example V). In instances 

25 where a RefSeq sequence was used as a protein homolog for the "exon-stretching" algorithm, a 

RefSeq identifier (denoted by "NM," "NP," or "NT') may be used in place of the GenBank identifier 
(i.e., gBBBBB). 

Alternatively, a prefix identifies component sequences that were hand-edited, predicted from 
genomic DNA sequences, or derived from a combination of sequence analysis methods. The 
30 following Table lists examples of component sequence prefixes and corresponding sequence analysis 
methods associated with the prefixes (see Example IV and Example V). 



Prefix 



Type of analysis and/or examples of programs 
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GNN.GFG, 
ENST 


Exon prediction from genomic sequences using, for example, 
GENSCAN (Stanford University, CA, USA) or FGENES 
(Computer Genomics Group, The Sanger Centre, Cambridge, UK). 


GBI 


Hand-edited analysis of genomic sequences. 


FL 


Stitched or stretched genomic sequences (see Example V). 


INCY 


Full length transcript and exon prediction from mapping of EST 
sequences to the genome. Genomic location and EST composition 
data are combined to predict the exons and resulting transcript 



In some cases, Incyte cDNA coverage redundant with the sequence coverage shown in 
column 5 was obtained to confirm the final consensus polynucleotide sequence, but the relevant 
Incyte cDNA identification numbers are not shown. 

10 Table 5 shows the representative cDNA libraries for those fiill length polynucleotide 

sequences which were assembled using Incyte cDNA sequences. The representative cDNA library is 
the Incyte cDNA library which is most frequently represented by the Incyte cDNA sequences which 
were used to assemble and confirm the above polynucleotide sequences. The tissues and vectors 
which were used to construct the cDNA libraries shown in Table 5 are described in Table 6. 

15 The invention also encompasses TRICK variants. A preferred TRICH variant is one which 

has at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the TRICH amino acid sequence, and which contains at least one functional or 
structural characteristic of TRICH. 

The invention also encompasses polynucleotides which encode TRICH In a particular 

20 embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
from the group consisting of SEQ ID NO:31-60, which encodes TRICH The polynucleotide 
sequences of SEQ ID NO: 3 1-60, as presented in the Sequence Listing, embrace the equivalent RNA 
sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the 
sugar backbone is composed of ribose instead of deoxyribose. 

25 The invention also encompasses a variant of a polynucleotide sequence encoding TRICH. In 

particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at 
least about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide 
sequence encoding TRICH. A particular aspect of the invention encompasses a variant of a 
polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID 

30 NO:3 1-60 which has at least about 70%, or alternatively at least about 85%, or even at least about 
95% polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting 
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of SEQ ID NO:31-60. Any one of the polynucleotide variants described above can encode an amino 
acid sequence which contains at least one functional or structural characteristic of TRICH. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
genetic code, a multitude of polynucleotide sequences encoding TRICH, some bearing minimal 
5 similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurring TRICH, and all such variations are to be considered 

10 as being specifically disclosed. 

Although nucleotide sequences which encode TRICH and its variants are generally capable of 
hybridizing to the nucleotide sequence of the naturally occurring TRICH under appropriately selected 
conditions of stringency, it may be advantageous to produce nucleotide sequences encoding TRICH 
or its derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally 

15 occurring codons. Codons may be selected to increase the rate at which expression of the peptide 
occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with which 
particular codons are utilized by the host. Other reasons for substantially altering the nucleotide 
sequence encoding TRICH and its derivatives without altering the encoded amino acid sequences 
include the production of RNA transcripts having more desirable properties, such as a greater 

20 half-life, than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode TRICH and 
TRICH derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art Moreover, synthetic chemistry may be used to 

25 introduce mutations into a sequence encoding TRICH or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:31-60 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 

30 152:507*5 11.) Hybridization conditions, including annealing and wash conditions, are described in 
"Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of 
the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment 
of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 
35 Biosystems), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 
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combinations of polymerases and proofreading exonucleases such as those found in the ELONGASE 
amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence preparation is 
automated with machines such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV), 
PTC200 thermal cycler (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler 
5 (Applied Biosystems). Sequencing is then carried out using either the ABI 373 or 377 DNA 
sequencing system (Applied Biosystems), the MEGABACE 1000 DNA sequencing system 
(Molecular Dynamics, Sunnyvale CA), or other systems known in the art. The resulting sequences 
are analyzed using a variety of algorithms which are well known in the art. (See, e.g., Ausubel, F.M. 
(1997) Short Protocols in Molecular Biology. John Wiley & Sons, New York NY, unit 7.7; Meyers, 

10 R.A. (1995) Molecular Biology and Biotechnology , Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding TRICH may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 
such as promoters and regulatory elements. For example, one method which may be employed, 
restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 

15 DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) 

Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising 
a known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 
Res. 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments 

20 adjacent to known sequences in human and yeast artificial chromosome DNA (See, e.g., Lagerstrom, 
M. etal. (1991) PCR Methods Applic. 1:111-119.) In this method, multiple restriction enzyme 
digestions and ligations may be used to insert an engineered double-stranded sequence into a region 
of unknown sequence before performing PCR. Other methods which may be used to retrieve 
unknown sequences are known in the art (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 

25 19:3055-3060). Additionally, one may use PCR, nested primers, and PROMOTERHNDER libraries 
(Clontech, Palo Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries 
and is useful in finding intron/exon junctions. For all PCR-based methods, primers may be designed 
using commercially available software, such as OLIGO 4.06 primer analysis software (National 
Biosciences, Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in 

30 length, to have a GC content of about 50% or more, and to anneal to the template at temperatures of 
about 68°C to 72°C. 

When screening for full length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 
35 library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence 
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into 5'non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze 
the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
5 specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 

10 which may be present in limited amounts in a particular sample. 

hi another embodiment of the invention, polynucleotide sequences or fragments thereof 
which encode TRICH may be cloned in recombinant DNA molecules that direct expression of 
TRICH, or fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent 
degeneracy of the genetic code, other DNA sequences which encode substantially the same or a 

15 functionally equivalent amino acid sequence may be produced and used to express TRICH. 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alter TRICH-encoding sequences for a variety of purposes including, but 
not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 

20 oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagenesis may be used to introduce mutations that create new restriction 
sites, alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 

25 5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C et al. (1999) Nat 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat Biotechnol. 14:315-319) to alter or 
improve the biological properties of TRICH, such as its biological or enzymatic activity or its ability 
to bind to other molecules or compounds. DNA shuffling is a process by which a library of gene 
variants is produced using PCR-mediated recombination of gene fragments. The library is then 

30 subjected to selection or screening procedures that identify those gene variants with the desired 

properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
point mutations may be recombined, screened, and then reshuffled until the desired properties are 

35 optimized. Alternatively, fragments of a given gene may be recombined with fragments of 
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homologous genes in the same gene family, either from the same or different species, thereby 
maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable 
manner. 

In another embodiment, sequences encoding TRICH may be synthesized, in whole or in part, 
5 using chemical methods well known in the art (See, e.g., Caruthers, M.H. et al (1980) Nucleic Acids 
Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) 
Alternatively, TRICH itself or a fragment thereof may be synthesized using chemical methods. For 
example, peptide synthesis can be performed using various solution-phase or solid-phase techniques. 
(See, e.g., Creighton, T. (1984) Proteins. Structures and Molecular Properties, WH Freeman, New 

10 York NY, pp. 55-60; and Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis 
may be achieved using the ABI 431 A peptide synthesizer (Applied Biosystems). Additionally, the 
amino acid sequence of TRICH, or any part thereof, may be altered during direct synthesis and/or 
combined with sequences from other proteins, or any part thereof, to produce a variant polypeptide or 
a polypeptide having a sequence of a naturally occurring polypeptide. 

15 The peptide may be substantially purified by preparative high performance liquid 

chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by 
sequencing. (See, e.g., Creighton, supra , pp. 28-53.) 

In order to express a biologically active TRICH, the nucleotide sequences encoding TRICH 

20 or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which 
contains the necessary elements for transcriptional and translalional control of the inserted coding 
sequence in a suitable host. These elements include regulatory sequences, such as enhancers, 
constitutive and inducible promoters, and 5' and 3' untranslated regions in the vector and in 
polynucleotide sequences encoding TRICH Such elements may vary in their strength and specificity. 

25 Specific initiation signals may also be used to achieve more efficient translation of sequences 

encoding TRICH. Such signals include the ATG initiation codon and adjacent sequences, e.g. the 
Kozak sequence. In cases where sequences encoding TRICH and its initiation codon and upstream 
regulatory sequences are inserted into the appropriate expression vector, no additional transcriptional 
or translational control signals may be needed. However, in cases where only coding sequence, or a 

30 fragment thereof, is inserted, exogenous translational control signals including an in-frame ATG 
initiation codon should be provided by the vector. Exogenous translational elements and initiation 
codons may be of various origins, both natural and synthetic. The efficiency of expression may be 
enhanced by the inclusion of enhancers appropriate for the particular host cell system used. (See, 
e.g„ Scharf, D. et al. (1994) Results Probl. Cell Differ. 20:125-162.) 

35 Methods which are well known to those skilled in the art may be used to construct expression 
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vectors containing sequences encoding TRICH and appropriate transcriptional and translational 
control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, 
and in vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning, A 
Laboratory Manual. Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et 
5 al. (1995) Current Protocols in Molecular Biology , John Wiley & Sons, New York NY, ch. 9, 13, and 
16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
encoding TRICH. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 

10 yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, 
or tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
animal cell systems. (See, e.g., Sambrook, supra : Ausubel, supra ; Van Heeke, G. and S.M. Schuster 
(1989) J. Biol. Chem. 264:5503-5509; Engelhard, E.K. et al. (1994) Proc. Natl. Acad Sci. USA 

15 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7: 1937-1945; Takamatsu, N. (1987) EMBO 
J. 6:307-311; The McGraw Mil Yearbook of Science and Technology (1992) McGraw Hill, New 
York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655-3659; and 
Harrington, J J. et al. (1997) Nat. Genet. 15:345-355.) Expression vectors derived from retroviruses, 
adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for 

20 delivery of nucleotide sequences to the targeted organ, tissue, or cell population. (See, e.g., Di 

Nicola, M. et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et al. (1993) Proc. Natl. Acad. Sci. 
USA 90(13):6340-6344; Buller, R.M. et al. (1985) Nature 317(6040):813-815; McGregor, D.P. et al. 
(1994) Mol. Immunol. 31(3):219-226; and Verma, I.M. and N. Somia (1997) Nature 389:239-242.) 
The invention is not limited by the host cell employed. 

25 In bacterial systems, a number of cloning and expression vectors may be selected depending 

upon the use intended for polynucleotide sequences encoding TRICH. For example, routine cloning, 
subcloning, and propagation of polynucleotide sequences encoding TRICH can be achieved using a 
multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 
plasmid (Life Technologies). Ligation of sequences encoding TRICH into the vector's multiple 

30 cloning site disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of 
transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for 
in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of 
nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. Schuster (1989) J. Biol. 
Chem 264:5503-5509.) When large quantities of TRICH are needed, e.g. for the production of 

35 antibodies, vectors which direct high level expression of TRICH may be used. For example, vectors 
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containing the strong, inducible SP6 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of TRICH. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such 

5 vectors direct either the secretion or intracellular retention of expressed proteins and enable 

integration of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 
1995, supra; Bitter, G.A. et al. (1987) Methods EnzymoL 153:516-544; and Scorer, C.A. et al. (1994) 
Bio/Technology 12:181-184.) 

Plant systems may also be used for expression of TRICBL Transcription of sequences 

10 encoding TRICH may be driven by viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 
6:307-31 1). Alternatively, plant promoters such as.the small subunit of RUBLSCO or heat shock 
promoters may be used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. 
(1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105.) 

15 These constructs can be introduced into plant cells by direct DNA transformation or 

pathogen-mediated transfection. (See, e.g., The McGraw Hill Yearbook of Science and Technology 
(1992) McGraw Hill, New York NY, pp. 191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding TRICH may be ligated into 

20 an adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses TRICH in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 

25 based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino 
polymers, or vesicles) for therapeutic purposes. (See, e.g.> Harrington, J J. et al. (1997) Nat. Genet 

30 15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable expression 
of TRICH in cell lines is preferred. For example, sequences encoding TRICH can be transformed 
into cell lines using expression vectors which may contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a separate vector. 
35 Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 days in 
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enriched media before being switched to selective media. The purpose of the selectable marker is to 
confer resistance to a selective agent, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed cells may be 
propagated using tissue culture techniques appropriate to the cell type. 
5 Any number of selection systems may be used to recover transformed cell lines. These 

include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in tk' and apr cells, respectively. (See, e.g., Wigler, M. et 
al. (1977) Cell 11:223-232; Lowy, L et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, 
or herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance to 

10 methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and als and pat 
confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., 
Wigler, M. et al. (1980) Proc. Nad. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) 
J. Mol. Biol. 150:1-14.) Additional selectable genes have been described, e.g., trpB and hisD, which 
alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. 

15 . Natl. Acad. Sci. USA 85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins 
(GFP; Clontech), B glucuronidase and its substrate B-glucuronide, or luciferase and its substrate 
Iuciferin may be used. These markers can be used not only to identify transformants, but also to 
quantify the amount of transient or stable protein expression attributable to a specific vector system. 
(See, e.g., Rhodes, C.A. (1995) Methods Mol. Biol. 55:121-131.) 

20 Although the presence/absence of marker gene expression suggests that the gene of interest is 

also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding TRICK is inserted within a marker gene sequence, transformed cells containing 
sequences encoding TRICH can be identified by the absence of marker gene function. Alternatively, 
a marker gene can be placed in tandem with a sequence encoding TRICH under the control of a single 

25 promoter. Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding TRICH and that express 
TRICH may be identified by a variety of procedures known to those of skill in the art These 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 

30 amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 
chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the expression of TRICH using either 
specific polyclonal or monoclonal antibodies are known in the art Examples of such techniques 
include enzyme-linked immunosorbent assays (BUS As), radioimmunoassays (RIAs), and 

35 fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
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monoclonal antibodies reactive to two non-interfering epitopes on TRICH is preferred, but a 
competitive binding assay may be employed. These and other assays are well known in the art. (See, 
e.g., Hampton, R. et al. (1990) Serological Methods, a Laboratory Manual, APS Press, St. Paul MN, 
Sect. IV; Coligan, J.E. et al. (1997) Current Prot ocols in Tmmunolopv. Greene Pub. Associates and 
5 Wiley-Interscience, New York NY; and Pound, J.D. (1998) Immunochemical Protocols , Humana 
Press, TotowaNJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 
may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting sequences related to polynucleotides encoding TRICH 

10 include oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
Alternatively, the sequences encoding TRICH, or any fragments thereof, may be cloned into a vector 
for the production of an mRNA probe. Such vectors are known in the art, are commercially available, 
and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 
such as 17, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety 

15 of commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega 
(Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for 
ease of detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic 
agents, as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding TRICH may be cultured under 

20 conditions suitable for the expression and recovery of the protein from cell culture. The protein 

produced by a transformed cell may be secreted or retained intracellularly depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors 
containing polynucleotides which encode TRICH may be designed to contain signal sequences which 
direct secretion of TRICH through a prokaryotic or eukaryotic cell membrane. 

25 In addition, a host cell strain may be chosen for its ability to modulate expression of the 

inserted sequences or to process the expressed protein in the desired fashion. Such modifications of 
the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 
phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a 4< prepro" or 
"pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. 

30 Different host cells which have specific cellular machinery and characteristic mechanisms for 

post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WB8) are available from the 
American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct 
modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 

35 sequences encoding TRICH may be ligated to a heterologous sequence resulting in translation of a 
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fusion protein in any of the aforementioned host systems. For example, a chimeric TRICH protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of TRICH activity. Heterologous protein and 
peptide moieties may also facilitate purification of fusion proteins using commercially available 
5 affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), 
maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-tfis, FLAG, 
c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-Efis enable purification of their 
cognate fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and 
metal-chelate resins, respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity 

10 purification of fusion proteins using commercially available monoclonal and polyclonal antibodies 
that specifically recognize these epitope tags. A fusion protein may also be engineered to contain a 
proteolytic cleavage site located between the TRICH encoding sequence and the heterologous protein 
sequence, so that TRICH may be cleaved away from the heterologous moiety following purification. 
Methods for fusion protein expression and purification are discussed in Ausubel (1995, supra, ch. 10). 

15 A variety of commercially available kits may also be used to facilitate expression and purification of 
fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled TRICH may be achieved 
in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These 
systems couple transcription and translation of protein-coding sequences operably associated with the 

20 T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid 
precursor, for example, 35 S-methionine. 

TRICH of the present invention or fragments thereof may be used to screen for compounds 
that specifically bind to TRICH. At least one and up to a plurality of test compounds may be 
screened for specific binding to TRICH. Examples of test compounds include antibodies, 

25 oligonucleotides, proteins (e.g., receptors), or small molecules. 

ha one embodiment, the compound thus identified is closely related to the natural ligand of 
TRICH, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
natural binding partner. (See, e.g., Coligan, J.E. et al. (1991) Current Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which TRICH 

30 binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the 
compound can be rationally designed using known techniques. In one embodiment, screening for 
these compounds involves producing appropriate cells which express TRICH, either as a secreted 
protein or on the ceD membrane. Preferred cells include cells from mammals, yeast, Drosophila. or 
E. coli . Cells expressing TRICH or cell membrane fractions which contain TRICH are then contacted 

35 with a test compound and binding, stimulation, or inhibition of activity of either TRICH or the 
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compound is analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, 
the assay may comprise the steps of combining at least one test compound with TRICH, either in 
solution or affixed to a solid support, and detecting the binding of TRICH to the compound. 
Alternatively, the assay may detect or measure binding of a test compound in the presence of a 
labeled competitor. Additionally, the assay may be carried out using cell-free preparations, chemical 
libraries, or natural product mixtures, and the test compound(s) may be free in solution or affixed to a 
solid support. 

TRICH of the present invention or fragments thereof may be used to screen for compounds 
that modulate the activity of TRICH. Such compounds may include agonists, antagonists, or partial 
or inverse agonists. In one embodiment, an assay is performed under conditions permissive for 
TRICH activity, wherein TRICH is combined with at least one test compound, and the activity of 
TRICH in the presence of a test compound is compared with the activity of TRICH in the absence of 
the test compound. A change in the activity of TRICH in the presence of the test compound is 
indicative of a compound that modulates the activity of TRICH. Alternatively, a test compound is 
combined with an in vitro or cell-free system comprising TRICH under conditions suitable for 
TRICH activity, and the assay is performed, hi either of these assays, a test compound which 
modulates the activity of TRICH may do so indirectly and need not come in direct contact with the 
test compound. At least one and up to a plurality of test compounds may be screened. 

In another embodiment, polynucleotides encoding TRICH or their mammalian homologs may 
be "knocked out" in an animal model system using homologous recombination in embryonic stem 
(ES) cells. Such techniques are well known in the art and are useful for the generation of animal 
models of human disease. (See, e.g., U.S. Patent Number 5,175,383 and U.S. Patent Number 
5,767,337.) For example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the 
early mouse embryo and grown in culture. The ES cells are transformed with a vector containing the 
gene of interest disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; 
Capecchi, M.R. (1989) Science 244: 1288-1292). The vector integrates into the corresponding region 
of the host genome by homologous recombination. Alternatively, homologous recombination takes 
place using the Cre-loxP system to knockout a gene of interest in a tissue- or developmental stage- 
specific manner (Marth, LD. (1996) Clin. Invest 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic 
Acids Res. 25:4323-4330). Transformed ES cells are identified and microinjected into mouse cell 
blastocysts such as those from the C57BL/6 mouse strain. The blastocysts are surgically transferred 
to pseudopregnant dams, and the resulting chimeric progeny are genotyped and bred to produce 
heterozygous or homozygous strains. Transgenic animals thus generated may be tested with potential 
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therapeutic or toxic agents. 

Polynucleotides encoding TRICH may also be manipulated in vitro in ES cells derived from 
human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
5 into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et ah 
(1998) Science 282: 1 145-1 147). 

Polynucleotides encoding TRICH can also be used to create "knockin" humanized animals 
(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a 
region of a polynucleotide encoding TRICH is injected into animal ES cells, and the injected 

10 sequence integrates into the animal cell genome. Transformed cells are injected into blastulae, and 
the blastulae are implanted as described above. Transgenic progeny or inbred lines are studied and 
treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 
Alternatively, a mammal inbred to overexpress TRICH, e.g., by secreting TRICH in its milk, may also 
serve as a convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 

15 THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of TRICH and transporters and ion channels. In addition, the expression of TRICH 
is closely associated with brain, liver, tumor, colon, thymus, small intestine, myometrium, testicular, 
bone marrow neuroblastoma tumor, parotid gland, lung, pituitary gland, and placental tissues, and 

20 Pompe's disease. Therefore, TRICH appears to play a role in transport, neurological, muscle, 
immunological, and cell proliferative disorders. In the treatment of disorders associated with 
increased TRICH expression or activity, it is desirable to decrease the expression or activity of 
TRICH In the treatment of disorders associated with decreased TRICH expression or activity, it is 
desirable to increase the expression or activity of TRICH 

25 Therefore, in one embodiment, TRICH or a fragment or derivative thereof may be 

administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of TRICH Examples of such disorders include, but are not limited to, a transport disorder 
such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, Becker* s 
muscular dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes insipidus, 

30 diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, normokalemic 
periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, myasthenia 
gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral neuropathy, cerebral 
neoplasms, prostate cancer, cardiac disorders associated with transport, e.g., angina, bradyarrythmia, 
tachyarrythmia, hypertension, Long QT syndrome, myocarditis, cardiomyopathy, nemaline 

35 myopathy, centronuclear myopathy, lipid myopathy, mitochondrial myopathy, thyrotoxic myopathy, 
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ethanol myopathy, dermatomyositis, inclusion body myositis, infectious myositis, polymyositis, 
neurological disorders associated with transport, e.g., Alzheimer's disease, amnesia, bipolar disorder, 
dementia, depression, epilepsy, Tourette's disorder, paranoid psychoses, and schizophrenia, and other 
disorders associated with transport, e.g., neurofibromatosis, postherpetic neuralgia, trigeminal 

5 neuropathy, sarcoidosis, sickle cell anemia, Wilson's disease, cataracts, infertility, pulmonary artery 
stenosis, sensorineural autosomal deafness, hyperglycemia, hypoglycemia, Grave's disease, goiter, 
Cushing's disease, Addison's disease, glucose^galactose malabsorption syndrome, 
hypercholesterolemia, adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital horn 
syndrome, von Gierke disease, cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease; a 

10 neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, 
Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other 
extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive 
neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other 
demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural 

15 abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous 
system disease, prion diseases including kuru, Greutzfeldt-Jakob disease, and Gerstmann- 
Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 

20 nervous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 

nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other 
neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis, 
inherited, metabolic, endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental 
disorders including mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), 

25 akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, 
postherpetic neuralgia, Tourette's disorder, progressive supranuclear palsy, corticobasal 
degeneration, and familial frontotemporal dementia; a muscle disorder such as cardiomyopathy, 
myocarditis, Duchenne's muscular dystrophy, Becker's muscular dystrophy, myotonic dystrophy, 
central core disease, nemaline myopathy, centronuclear myopathy, lipid myopathy, mitochondrial 

30 myopathy, infectious myositis, polymyositis, dermatomyositis, inclusion body myositis, thyrotoxic 
myopathy, ethanol myopathy, angina, anaphylactic shock, arrhythmias, asthma, cardiovascular shock, 
Cushing's syndrome, hypertension, hypoglycemia, myocardial infarction, migraine, 
pheochromocytoma, and myopathies including encephalopathy, epilepsy, Kearns-Sayre syndrome, 
lactic acidosis, myoclonic disorder, ophthalmoplegia, and acid maltase deficiency (AMD, also known 
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as Pompe's disease); an immunological disorder such as acquired immunodeficiency syndrome 
(AIDS), Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, 
amyloidosis, anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune 
thyroiditis, autoimmune polyendoainopathy-candidiasis-ectodermal dystrophy (APECED), 
5 bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dennatomyositis, 
diabetes mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, 
erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' 
disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, 
myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, 
10 polymyositis, psoriasis, Reiter' s syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, 
systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, 
ulcerative colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and 
extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and 
trauma; and a cell proliferative disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, 
15 bursitis, cirrhosis, hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal 
nocturnal hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers 
including adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, 
and, in particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, 
20 penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus. 

In another embodiment, a vector capable of expressing TRICH or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of TRICH including, but not limited to, those described above. 

In a further embodiment, a composition comprising a substantially purified TRICH in 
25 conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent 
a disorder associated with decreased expression or activity of TRICH including, but not limited to, 
those provided above. 

In still another embodiment, an agonist which modulates the activity of TRICH may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
30 activity of TRICH including, but not limited to, those listed above. 

hi a further embodiment, an antagonist of TRICH may be administered to a subject to treat or 
prevent a disorder associated with increased expression or activity of TRICH. Examples of such 
disorders include, but are not limited to, those transport, neurological, muscle, immunological, and 
cell proliferative disorders described above. In one aspect, an antibody which specifically binds 
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TRICH may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for 
bringing a pharmaceutical agent to cells or tissues which express TRICH. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding TRICH may be administered to a subject to treat or prevent a disorder associated with 
5 increased expression or activity of TRICH including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
sequences, or vectors of the invention may be administered in combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made 
by one of ordinary skill in the art, according to conventional pharmaceutical principles. The 

10 combination of therapeutic agents may act synergistically to effect the treatment or prevention of the 
various disorders described above. Using this approach, one may be able to achieve therapeutic 
efficacy with lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of TRICH may be produced using methods which are generally known in the 
art. In particular, purified TRICH may be used to produce antibodies or to screen libraries of 

15 pharmaceutical agents to identify those which specifically bind TRICH Antibodies to TRICH may 
also be generated using methods that are well known in the art Such antibodies may include, but are 
not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and 
fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit 
dimer formation) are generally preferred for therapeutic use. 

20 For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 

and others may be immunized by injection with TRICH or with any fragment or oligopeptide thereof 
which has immunogenic properties. Depending on the host species, various adjuvants may be used to 
increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral 
gels such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic 

25 polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in 
humans, BCG (bacilli Calmette-Guerin) and Corvnebacterium parvum are especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
TRICH have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 

30 fragments are identical to a portion of the amino acid sequence of the natural protein. Short stretches 
of TRICH amino acids may be fused with those of another protein, such as KLH, and antibodies to 
the chimeric molecule may be produced. 

Monoclonal antibodies to TRICH may be prepared using any technique which provides for 
the production of antibody molecules by continuous cell lines in culture. These include, but are not 

35 limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 
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technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 

Immunol. Methods 81:31-42; Cote, RJ. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 

Cole, SJP. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
5 splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 

antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 

Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, 

S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 

chain antibodies may be adapted, using methods known in the art, to produce TRICH-specific single 
10 chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 

generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., 

Burton, D.R. (1991) Proc. Natl. Acad. Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 

population or by screening immunoglobulin libraries or panels of highly specific binding reagents as 
15 disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 

86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for TRICH may also be generated. 

For example, such fragments include, but are not limited to, F(ab , ) 2 fragments produced by pepsin 

digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
20 the F(ab*)2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 

easy identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. 

et al. (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 

specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
25 polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 

immunoassays typically involve the measurement of complex formation between TRICH and its 

specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 

reactive to two non-interfering TRICH epitopes is generally used, but a competitive binding assay 

may also be employed (Pound, supra) . 
30 Various methods such as Scatchard analysis in conjunction with radioimmunoassay 

techniques may be used to assess the affinity of antibodies for TRICH. Affinity is expressed as an 

association constant, which is defined as the molar concentration of TRICH-antibody complex 

divided by the molar concentrations of free antigen and free antibody under equilibrium conditions. 

The determined for a preparation of polyclonal antibodies, which are heterogeneous in their 
35 affinities for multiple TRICH epitopes, represents the average affinity, or avidity, of the antibodies 
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for TRICH. The determined for a preparation of monoclonal antibodies, which are monospecific 
for a particular TRICH epitope, represents a true measure of affinity. High-affinity antibody 
preparations with Ka ranging from about l(f to 10 12 L/mole are preferred for use in immunoassays in 
which the TRICH-antibody complex must withstand rigorous manipulations. Low-affinity antibody 

5 preparations with Ka ranging from about 10 6 to 10 7 L/mole are preferred for use in 

immunopurification and similar procedures which ultimately require dissociation of TRICH, 
preferably in active form, from the antibody (Catty, D. (1988) Antibodies. Volume £ A Practical 
A pproach, IRL Press, Washington DC; Liddell, J.E. and A. Oyer (1991) A Practical Guide to 
Monoclonal Antibodies, John Wiley & Sons, New York NY). 

10 The titer and avidity of polyclonal antibody preparations may be further evaluated to 

determine the quality and suitability of such preparations for certain downstream applications. For 
example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
preferably 5-10 mg specific antibody/ml, is generally employed in procedures requiring precipitation 
of TRICH-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, and 

15 guidelines for antibody quality and usage in various applications, are generally available. (See, e.g., 
Catty, supra, and Coligan et al. supra .) 

In another embodiment of the invention, the polynucleotides encoding TRICH, or any 
fragment or complement thereof, may be used for therapeutic purposes. In one aspect, modifications 
of gene expression can be achieved by designing complementary sequences or antisense molecules 

20 (DNA, RNA, PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene 
encoding TRICH. Such technology is well known inthe art, and antisense oligonucleotides or larger 
fragments can be designed from various locations along the coding or control regions of sequences 
encoding TRICH. (See, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics , Humana Press Inc., 
Totawa NJ.) 

25 In therapeutic use, any gene delivery system suitable for introduction of the antisense 

sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellularly in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 
Slater, JJB. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, KJ. et al. (1995) 

30 9(13): 1288-1296.) Antisense sequences can also be introduced intracellularly through the use of viral 
vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g., Miller, A.D. (1990) Blood 
76:271; Ausubel, supra ; Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 
systems known in the art. (See, e.g., Rossi, JJ. (1995) Br. Med. Bull. 51(l):217-225; Boado, RJ. et 

35 al. (1998) J. Pharm. Sci. 87(11):1308-1315; and Morris, M.C et al. (1997) Nucleic Acids Res. 
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25(14):2730-2736.) 

In another embodiment of the invention, polynucleotides encoding TRICH may be used for 
somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g., in the cases of severe combined immunodeficiency (SCID)-X1 disease characterized by X- 
5 linked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 
immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 
Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 

10 hypercholesterolemia, and hemophilia resulting from Factor Vm or Factor IX deficiencies (Crystal, 
R.G. (1995) Science 270:404-410; Verma, I.M. and N. Somia (1997) Nature 389:239-242)), (ii) 
express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against intracellular parasites 
(e.g., against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. 

15 (1988) Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sci. USA. 93: 1 1395-1 1399), 
hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
brasiliensis ; and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruzp . In the 
case where a genetic deficiency in TRICH expression or regulation causes disease, the expression of 
TRICH from an appropriate population of transduced cells may alleviate the clinical manifestations 

20 caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in 
TRICH are treated by constructing mammalian expression vectors encoding TRICH and introducing 
these vectors by mechanical means into TRICH-deficient cells. Mechanical transfer technologies for 
use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) 

25 ballistic gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene 
transfer, and (v) the use of DNA transposons (Morgan, R.A. and WJ 7 . Anderson (1993) Annu. Rev. 
Biochem. 62:191-217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. andH. R6cipon (1998) Curr. 
Opin. Biotechnol. 9:445-450). 

Expression vectors that may be effective for the expression of TRICH include, but are not 

30 limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX, PCR2-TOPOTA vectors 

(Invitrogen, Carlsbad CA), PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Joila CA), 
and PTET-OFF, PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). TRICH 
may be expressed using (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), 
Rous sarcoma virus (RSV), SV40 virus, thymidine kinase (TK), or 0-actin genes), (ii) an inducible 

35 promoter (e.g., the tetracycline-regulated promoter (Gossen, M. and H. Bujard ( 1992) Proc. Natl. 
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Acad. Sci. USA 89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. and 
H.M. Blau (1998) Curr. Opin. Biotechnol. 9:451-456), commercially available in the T-REX plasmid 
(Invitrogen)); the ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; 
Invitxogen); the FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter 

5 (Rossi, F.M.V. and Blau, H.M. supraT ). or (iii) a tissue-specific promoter or the native promoter of the 
endogenous gene encoding TRICH from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 

10 parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, FJL. and AJ. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
(1982) EMBO J. 1:841-845). The introduction of DNA to primary cells requires modification of 
these standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 

15 respect to TRICH expression are treated by constructing a retrovirus vector consisting of (i). the 

polynucleotide encoding TRICH under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovirus cw-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 

20 commercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. 
Natl. Acad. Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in 
an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61:1639-1646; Adam, M.A. and 

25 A.D. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. 
et al. (1998) J. Virol. 72:9873-9880). U.S. Patent Number 5,910,434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") 
discloses a method for obtaining retrovirus packaging cell lines and is hereby incorporated by 
reference. Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4 + T- 

30 cells), and the return of transduced cells to a patient are procedures well known to persons skilled in 
the art of gene therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020- 
7029; Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, MI. (1997) J. Virol. 71:4707^716; 
Ranga, U. et al. (1998) Proc. Natl. Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283- 
2290). 

35 In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 
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polynucleotides encoding TRICH to cells which have one or more genetic abnormalities with respect 
to the expression of TRICH. The construction and packaging of adenovirus-based vectors are well 
known to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to 
be versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
5 (Csete, MLE. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
described in U.S. Patent Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), 
hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P.A. et al. (1999) 
Annu. Rev. Nutr. 19:511-544 and Verma, I.M. and N. Somia (1997) Nature 18:389:239-242, both 
incorporated by reference herein. 

10 In another alternative, a herpes-based, gene therapy delivery system is used to deliver 

polynucleotides encoding TRICH to target cells which have one or more genetic abnormalities with 
respect to the expression of TRICH. The use of herpes simplex virus (HSV)-based vectors may be 
especially valuable for introducing TRICH to cells of the central nervous system, for which HSV has 
a tropism. The construction and packaging of herpes-based vectors are well known to those with 

15 ordinary skill in the art A replication-competent herpes simplex virus (HSV) type 1-based vector has 
been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye Res. 
169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent Number 5,804,413 to DeLuca ("Herpes simplex virus strains for gene transfer"), which is 
hereby incorporated by reference. U.S. Patent Number 5,804,413 teaches the use of recombinant 

20 HSV d92 which consists of a genome containing at least one exogenous gene to be transferred to a 
cell under the control of the appropriate promoter for purposes including human gene therapy. Also 
taught by this patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 
and ICP22. For HSV vectors, see also Goins, W.F. et al. (1999) J. Virol. 73:519-532 and Xu, H. et al. 
(1994) Dev. Biol. 163:152-161, hereby incorporated by reference. The manipulation of cloned 

25 herpesvirus sequences, the generation of recombinant virus following the transfection of multiple 
plasmids containing different segments of the large herpesvirus genomes, the growth and propagation 
of herpesvirus, and the infection of cells with herpesvirus are techniques well known to those of 
ordinary skill in the art 

La another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 

30 deliver polynucleotides encoding TRICH to target cells. The biology of the prototypic alphavirus, 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 
on the SFV genome (Garoff, H. and K.-J. li (1998) Curr. Opin. Biotechnol. 9:46^469). During 
alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the full length genomic RNA, 

35 resulting in the overproduction of capsid proteins relative to the viral proteins with enzymatic activity 
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(e.g., protease and polymerase). Similarly, inserting the coding sequence for TRICH into the 
alphavirus genome in place of the capsid-codihg region results in the production of a large number of 
TRICH-coding RNAs and the synthesis of high levels of TRICH in vector transduced cells. While 
alphavirus infection is typically associated with cell lysis within a few days, the ability to establish a 
5 persistent infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) 
indicates that the lytic replication of alphaviruses can be altered to suit the needs of the gene therapy 
application (Dryga, S.A. et al. (1997) Virology 228:74-83). The wide host range of alphaviruses will 
allow the introduction of TRICH into a variety of cell types. The specific transduction of a subset of 
cells in a population may require the sorting of cells prior to transduction. The methods of 
10 manipulating infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA 

transfections, and performing alphavirus infections, are well known to those with ordinary skill in the 
art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions 
-10 and +10 from the start site, may also be employed to inhibit gene expression. Similarly, 

15 inhibition can be achieved using triple helix base-pairing methodology. Triple helix pairing is useful 
because it causes inhibition of the ability of the double helix to open sufficiently for the binding of 
polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances using 
triplex DNA have been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, B.E. 
and B.I. Carr, Molecular and Immunologic Approaches, Futura Publishing, Mt. Kisco NY, pp. 163- 

20 177.) A complementary sequence or antisense molecule may also be designed to block translation of 
mRNA by preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 

25 engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding TRICH. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 

30 corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared 

35 by any method known in the art for the synthesis of nucleic acid molecules. These include techniques 
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for chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA 
sequences encoding TRICK Such DNA sequences may be incorporated into a wide variety of 
vectors with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA 

5 constructs that synthesize complementary RNA, constitutively or inducibly, can be introduced into 
cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' 
ends of the molecule, or the use of phosphorothioate or T O-methyl rather than phosphodiesterase 

10 linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nontraditional bases such as inosine, 
queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, 
cytidine, guanine, thymine, and uridine which are not as easily recognized by endogenous 
endonucleases. 

15 An additional embodiment of the invention encompasses a method for screening for a 

compound which is effective in altering expression of a polynucleotide encoding TRICH. 
Compounds which may be effective in altering expression of a specific polynucleotide may include, 
but are not limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming 
oligonucleotides, transcription factors and other polypeptide transcriptional regulators, and non- 
20 macromolecular chemical entities which are capable of interacting with specific polynucleotide 

sequences. Effective compounds may alter polynucleotide expression by acting as either inhibitors or 
promoters of polynucleotide expression. Thus, in the treatment of disorders associated with increased 
TRICH expression or activity, a compound which specifically inhibits expression of the 
polynucleotide encoding TRICH may be therapeutically useful, and in the treatment of disorders 
25 associated with decreased TRICH expression or activity, a compound which specifically promotes 
expression of the polynucleotide encoding TRICH may be therapeutically useful. 

At least one, and up to a plurality, of test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective in 
30 altering polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound 
based on chemical and/or structural properties of the target polynucleotide; and selection from a 
library of chemical compounds created combinatorially or randomly. A sample comprising a 
polynucleotide encoding TRICH is exposed to at least one test compound thus obtained. The sample 
35 may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted 
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biochemical system. Alterations in the expression of a polynucleotide encoding TRICH are assayed 
by any method commonly known in the art Typically, the expression of a specific nucleotide is 
detected by hybridization with a probe having a nucleotide sequence complementary to the sequence 
of the polynucleotide encoding TRICH. The amount of hybridization may be quantified, thus 

5 forming the basis for a comparison of the expression of the polynucleotide both with and without 
exposure to one or more test compounds. Detection of a change in the expression of a polynucleotide 
exposed to a test compound indicates that the test compound is effective in altering the expression of 
the polynucleotide. A screen for a compound effective in altering expression of a specific 
polynucleotide can be carried out, for example, using a Schizosaccharomvces pombe gene expression 

10 system (Atkins, D. et al. (1999) U.S. Patent No. 5,932,435; Arndt, G.M. et al. (2000) Nucleic Acids 
Res. 28:E15) or a human cell line such as HeLa cell (Clarke, MX. et al. (2000) Biochenx Biophys. 
Res. Commun. 268:8-13). A particular embodiment of the present invention involves screening a 
combinatorial library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide 
nucleic acids, and modified oligonucleotides) for antisense activity against a specific polynucleotide 

15 sequence (Bruice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. 
Patent No. 6,022,691). 

Many methods for introducing vectors into cells or tissues are available and equally suitable 
for use in vivo, in vitro, and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patient. 
20 Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat 
Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
25 monkeys. 

An additional embodiment of the invention relates to the administration of a composition 
which generally comprises an active ingredient formulated with a pharmaceutically acceptable 
excipient Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. 
Various formulations are commonly known and are thoroughly discussed in the latest edition of 

30 Remington's Pharmaceutical Sciences (Maack Publishing, Easton PA). Such compositions may 

consist of TRICH, antibodies to TRICH, and numerics, agonists, antagonists, or inhibitors of TRICH. 

The compositions utilized in this invention may be administered by any number of routes 
including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, 
intrathecal, intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, 

35 enteral, topical, sublingual, or rectal means. 
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Compositions for pulmonary administration may be prepared in liquid or dry powder form. 
These compositions are generally aerosolized immediately prior to inhalation by the patient. La the 
case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of 
fast-acting formulations is well-known in the art. In the case of macromolecules (e.g. larger peptides 
5 and proteins), recent developments in the field of pulmonary delivery via the alveolar region of the 
lung have enabled the practical delivery of drugs such as insulin to blood circulation (see, e.g., Patton, 
J.S. et al., U.S. Patent No. 5,997,848). Pulmonary delivery has the advantage of administration 
without needle injection, and obviates the need for potentially toxic penetration enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 

10 ingredients are contained in an effective amount to achieve the intended purpose. The determination 
of an effective dose is well within the capability of those skilled in the art 

Specialized forms of compositions may be prepared for direct intracellular delivery of 
macromolecules comprising TRICH or fragments thereof. For example, liposome preparations 
containing a cell-impermeable macromolecule may promote cell fusion and intracellular delivery of 

15 the macromolecule. Alternatively, TRICH or a fragment thereof may be joined to a short cationic N- 
terminal portion from the HIV Tat-1 protein. Fusion proteins thus generated have been found to 
transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. et 
al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 

20 culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, 

monkeys, or pigs. An animal model may also be used to determine the appropriate concentration 
range and route of administration. Such information can then be used to determine useful doses and 
routes for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 

25 TRICH or fragments thereof, antibodies of TRICH, and agonists, antagonists or inhibitors of TRICH, 
which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined 
by standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 
calculating the ED^ (the dose therapeutically effective in 50% of the population) or LDso (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 

30 therapeutic index, which can be expressed as the LD 5{ /ED 50 ratio. Compositions which exhibit large 
therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are 
used to formulate a range of dosage for human use. The dosage contained in such compositions is 
preferably within a range of circulating concentrations that includes the ED^ with little or no toxicity. 
The dosage varies within this range depending upon the dosage form employed, the sensitivity of the 

35 patient, and the route of administration. 

58 



WO 02/12340 



PCT/US01/24217 



The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the 
active moiety or to maintain the desired effect. Factors which may be taken into account include the 
severity of the disease state, the general health of the subject, the age, weight, and gender of the 

5 subject, time and frequency of administration, drug combination(s), reaction sensitivities, and 

response to therapy. Long-acting compositions may be administered every 3 to 4 days, every week, 
or biweekly depending on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0.1 fig to 100,000 /^g, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 

10 methods of delivery is provided in the literature and generally available to practitioners in the art 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

15 In another embodiment, antibodies which specifically bind TRICH may be used for the 

diagnosis of disorders characterized by expression of TRICH, or in assays to monitor patients being 
treated with TRICH or agonists, antagonists, or inhibitors of TRICH. Antibodies useful for 
diagnostic purposes may be prepared in the same manner as described above for therapeutics. 
Diagnostic assays for TRICH include methods which utilize the antibody and a label to detect TRICH 

20 in human body fluids or in extracts of cells or tissues. The antibodies may be used with or without 
modification, and may be labeled by covalent or non-covalent attachment of a reporter molecule. A 
wide variety of reporter molecules, several of which are described above, are known in the art and 
may be used. 

A variety of protocols for measuring TRICH, including ELISAs, RIAs, and FACS, are known 
25 in the art and provide a basis for diagnosing altered or abnormal levels of TRICH expression. Normal 
or standard values for TRICH expression are established by combining body fluids or cell extracts 
taken from normal mammalian subjects, for example, human subjects, with antibodies to TRICH 
under conditions suitable for complex formation. The amount of standard complex formation may be 
quantitated by various methods, such as photometric means. Quantities of TRICH expressed in 
30 subject, control, and disease samples from biopsied tissues are compared with the standard values. 
Deviation between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding TRICH may be used 
for diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect 
35 and quantify gene expression in biopsied tissues in which expression of TRICH may be correlated 
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with disease. The diagnostic assay may be used to determine absence, presence, and excess 
expression of TRICH, and to monitor regulation of TRICH levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding TRICH or closely related molecules may be used 
5 to identify nucleic acid sequences which encode TRICH. The specificity of the probe, whether it is 
made from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a 
conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe identifies only naturally occurring sequences encoding TRICH, allelic variants, or related 
sequences. 

10 Probes may also be used for the detection of related sequences, and may have at least 50% 

sequence identity to any of the TRICH encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:31-60 or from 
genomic sequences including promoters, enhancers, and introns of the TRICH gene. 

Means for producing specific hybridization probes for DNAs encoding TRICH include the 

15 cloning of polynucleotide sequences encoding TRICH or TRICH derivatives into vectors for the 

production of mRNA probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, 

20 such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 
Polynucleotide sequences encoding TRICH may be used for the diagnosis of disorders 
associated with expression of TRICH. Examples of such disorders include, but are not limited to, a 
transport disorder such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, 
Becker's muscular dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes 

25 insipidus, diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, 

normokalemic periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, 
myasthenia gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral 
neuropathy, cerebral neoplasms, prostate cancer, cardiac disorders associated with transport, e.g., 
angina, bradyarrythmia, tachyarrythmia, hypertension, Long QT syndrome, myocarditis, 

30 cardiomyopathy, nemaline myopathy, centronuclear myopathy, lipid myopathy, mitochondrial 
myopathy, thyrotoxic myopathy, ethanol myopathy, dermatomyositis, inclusion body myositis, 
infectious myositis, polymyositis, neurological disorders associated with transport, e.g., Alzheimer's 
disease, amnesia, bipolar disorder, dementia, depression, epilepsy, Tourette's disorder, paranoid 
psychoses, and schizophrenia, and other disorders associated with transport, e.g., neurofibromatosis, 

35 postherpetic neuralgia, trigeminal neuropathy, sarcoidosis, sickle cell anemia, Wilson's disease, 
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cataracts, infertility, pulmonary artery stenosis, sensorineural autosomal deafness, hyperglycemia, 
hypoglycemia, Grave's disease, goiter, Cushing's disease, Addison's disease, glucose-galactose 
malabsorption syndrome, hypercholesterolemia, adienoleukodystrophy, Zellweger syndrome, 
Menkes disease, occipital horn syndrome, von Gierke disease, cystinuria, iminoglycinuria, Hartup 

5 disease, and Fanconi disease; a neurological disorder such as epilepsy, ischemic cerebrovascular 
disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, Huntington's disease, 
dementia, Parkinson's disease and other exfrapyramidal disorders, amyotrophic lateral sclerosis and 
other motor neuron disorders, progressive neural muscular atrophy, retinitis pigmentosa, hereditary 
ataxias, multiple sclerosis and other demyelinating diseases, bacterial and viral meningitis, brain 

10 abscess, subdural empyema, epidural abscess, suppurative intracranial thrombophlebitis, myelitis and 
radiculitis, viral central nervous system disease, prion diseases including kuru, Creutzfeldt-Jakob 
disease, and Gerstmann-Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and 
metabolic diseases of the nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal 
hemangioblastomatosis, encephalotrigeminal syndrome, mental retardation and other developmental 

15 disorders of the central nervous system including Down syndrome, cerebral palsy, neuroskeletal 
disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord diseases, 
muscular dystrophy and other neuromuscular disorders, peripheral nervous system disorders, 
dermatomyositis and polymyositis, inherited, metabolic, endocrine, and toxic myopathies, 
myasthenia gravis, periodic paralysis, mental disorders including mood, anxiety, and schizophrenic 

20 disorders, seasonal affective disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, 
tardive dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, 
progressive supranuclear palsy, corticobasal degeneration, and familial frontotemporal dementia; a 
muscle disorder such as cardiomyopathy, myocarditis, Duchenne's muscular dystrophy, Becker's 
muscular dystrophy, myotonic dystrophy, central core disease, nemaline myopathy, centronuclear 

25 myopathy, lipid myopathy, mitochondrial myopathy, infectious myositis, polymyositis, 
dermatomyositis, inclusion body myositis, thyrotoxic myopathy, ethanol myopathy, angina, 
anaphylactic shock, arrhythmias, asthma, cardiovascular shock, dishing' s syndrome, hypertension, 
hypoglycemia, myocardial infarction, migraine, pheochromocytoma, and myopathies including 
encephalopathy, epilepsy, Kearns-Sayre syndrome, lactic acidosis, myoclonic disorder, 

30 ophthalmoplegia, and acid maltase deficiency (AMD, also known as Pompe's disease); an 

immunological disorder such as acquired immunodeficiency syndrome (AIDS), Addison's disease, 
adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
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dermatitis, Crohn's disease, atopic dermatitis, dennatomyositis, diabetes mellitus, emphysema, 
episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's 
thyroiditis, hypeieosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
5 myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, 
psoriasis, Reiter^ syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic 
anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative 
colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and extracorporeal 
circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and trauma; and a 

10 cell proliferative disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, 
hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 

15 bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, 
penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus. The polynucleotide 
sequences encoding TRICH may be used in Southern or northern analysis, dot blot, or other 
membrane-based technologies; in PCR technologies; in dipstick, pin, and multiformat ELISA-like 
assays; and in microarrays utilizing fluids or tissues from patients to detect altered TRICH 

20 expression. Such qualitative or quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding TRICH may be useful in assays that 
detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding TRICH may be labeled by standard methods and added to a fluid or tissue sample 
from a patient under conditions suitable for the formation of hybridization complexes. After a 

25 suitable incubation period, the sample is washed and the signal is quantified and compared with a 
standard value. If the amount of signal in the patient sample is significantly altered in comparison to 
a control sample then the presence of altered levels of nucleotide sequences encoding TRICH in the 
sample indicates the presence of the associated disorder. Such assays may also be used to evaluate 
the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to 

30 monitor the treatment of an individual patient 

hi order to provide a basis for the diagnosis of a disorder associated with expression of 
TRICH, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, encoding TRICH, under conditions suitable for hybridization or 

35 amplification. Standard hybridization may be quantified by comparing the values obtained from 
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normal subjects with values from an experiment in which a known amount of a substantially purified 
polynucleotide is used. Standard values obtained in this manner may be compared with values 
obtained from samples from patients who are symptomatic for a disorder. Deviation from standard 
values is used to establish the presence of a disorder. 
5 Once the presence of a disorder is established and a treatment protocol is initiated, 

hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

10 With respect to cancer, the presence of an abnormal amount of transcript (either under- or 

overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to the appearance 
of actual clinical symptoms. A more definitive diagnosis of this type may allow health professionals 
to employ preventative measures or aggressive treatment earlier thereby preventing the development 

15 or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding 
TRICH may involve the use of PCR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide 
encoding TRICH, or a fragment of a polynucleotide complementary to the polynucleotide encoding 

20 TRICH, and will be employed under optimized conditions for identification of a specific gene or 
condition. Oligomers may also be employed under less stringent conditions for detection or 
quantification of closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 
encoding TRICH may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 

25 substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic 
disease in humans. Methods of SNP detection include, but are not limited to, single-stranded 
conformation polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, 
oligonucleotide primers derived from the polynucleotide sequences encoding TRICH are used to 
amplify DNA using the polymerase chain reaction (PCR). The DNA may be derived, for example, 

30 from diseased or normal tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause 
differences in the secondary and tertiary structures of PCR products in single-stranded form, and 
these differences are detectable using gel electrophoresis in non-denaturing gels. In fSCCP, the 
oligonucleotide primers are fluorescently labeled, which allows detection of the amplimers in high- 
throughput equipment such as DNA sequencing machines. Additionally, sequence database analysis 

35 methods, termed in silico SNP (isSNP), are capable of identifying polymorphisms by comparing the 
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sequence of individual overlapping DNA fragments which assemble into a common consensus 
sequence. These computer-based methods filter out sequence variations due to laboratory preparation 
of DNA and sequencing errors using statistical models and automated analyses of DNA sequence 
chromatograms. hi the alternative, SNPs may be detected and characterized by mass spectrometry 

5 using, for example, the high throughput MASSARRAY system (Sequenom, Inc., San Diego CA). 

Methods which may also be used to quantify the expression of TRICH include radiolabeling 
or biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. 
et al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 

10 accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 
interest is presented in various dilutions and a spectrophotometric or colorimetric response gives 
rapid quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as elements on a microarray. The microarray 

15 can be used in transcript imaging techniques which monitor the relative expression levels of large 
numbers of genes simultaneously as described below. The microarray may also be used to identify 
genetic variants, mutations, and polymorphisms. This information may be used to determine gene 
function, to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 
progression/regression of disease as a function of gene expression, and to develop and monitor the 

20 activities of therapeutic agents in the treatment of disease. In particular, this information may be used 
to develop a pharmacogenonric profile of a patient in order to select the most appropriate and 
effective treatment regimen for that patient. For example, therapeutic agents which are highly 
effective and display the fewest side effects may be selected for a patient based on his/her 
pharmacogenomic profile. 

25 In another embodiment, TRICH, fragments of TRICH, or antibodies specific for TRICH may 

be used as elements on a microarray. The microarray may be used to monitor or measure protein- 
protein interactions, drug-target interactions, and gene expression profiles, as described above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 
generate a transcript image of a tissue or cell type. A transcript image represents the global pattern of 

30 gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 
quantifying the number of expressed genes and their relative abundance under given conditions and at 
a given time. (See Seilhamer et al., "Comparative Gene Transcript Analysis," U.S. Patent Number 
5,840,484, expressly incorporated by reference herein.) Thus a transcript image may be generated by 
hybridizing the polynucleotides of the present invention or their complements to the totality of 

35 transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the 
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hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
invention or their complements comprise a subset of a plurality of elements on a microarray. The 
resultant transcript image would provide a profile of gene activity. 

Transcript images may be generated using transcripts isolated from tissues, cell lines, 

5 biopsies, or other biological samples. The transcript image may thus reflect gene expression in vivo , 
as in the case of a tissue or biopsy sample, or in vitro, as in the case of a cell line. 

Transcript images which profile the expression of the polynucleotides of the present 
.invention may also be used in conjunction with in vitro model systems and preclinical evaluation of 
pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental 

10 compounds. All compounds induce characteristic gene expression patterns, frequendy termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and 
toxicity (Nuwaysir, E.F. et al. (1999) Mol. Carcinog. 24: 153-159; Steiner, S. and NJL Anderson 
(2000) Toxicol. Lett 1 12-1 13:467-471, expressly incorporated by reference herein). If a test 
compound has a signature similar to that of a compound with known toxicity, it is likely to share 

15 those toxic properties. These fingerprints or signatures are most useful and refined when they contain 
expression information from a large number of genes and gene families. Ideally, a genome-wide 
measurement of expression provides the highest quality signature. Even genes whose expression is 
not altered by any tested compounds are important as well, as the levels of expression of these genes 
are used to normalize the rest of the expression data. The normalization procedure is useful for 

20 comparison of expression data after treatment with different compounds. While the assignment of 
gene function to elements of a toxicant signature aids in interpretation of toxicity mechanisms, 
knowledge of gene function is not necessary for the statistical matching of signatures which leads to 
prediction of toxicity. (See, for example, Press Release 00-02 from the National Institute of 
Environmental Health Sciences, released February 29, 2000, available at 

25 http://www.niehs.nih.gov/oc/news/toxchip.htm.) Therefore, it is important and desirable in 
toxicological screening using toxicant signatures to include all expressed gene sequences. 

In one embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing nucleic acids with the test compound. Nucleic acids that are expressed in the 
treated biological sample are hybridized with one or more probes specific to the polynucleotides of 

30 the present invention, so that transcript levels corresponding to the polynucleotides of the present 
invention may be quantified. The transcript levels in the treated biological sample are compared with 
levels in an untreated biological sample. Differences in the transcript levels between the two samples 
are indicative of a toxic response caused by the test compound in the treated sample. 

Another particular embodiment relates to the use of the polypeptide sequences of the present 

35 invention to analyze the proteome of a tissue or cell type. The term proteome refers to the global 
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pattern of protein expression in a particular tissue or cell type. Each protein component of a 
proteome can be subjected individually to further analysis. Proteome expression patterns, or profiles, 
are analyzed by quantifying the number of expressed proteins and their relative abundance under 
given conditions and at a given time. A profile of a cell's proteome may thus be generated by 

5 separating and analyzing the polypeptides of a particular tissue or cell type. In one embodiment, the 
separation is achieved using two-dimensional gel electrophoresis, in which proteins from a sample are 
separated by isoelectric focusing in the first dimension, and then according to molecular weight by 
sodium dodecyl sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, 
supra) . The proteins are visualized in the gel as discrete and uniquely positioned spots, typically by 

10 staining the gel with an agent such as Coomassie Blue or silver or fluorescent stains. The optical 
density of each protein spot is generally proportional to the level of the protein in the sample. The 
optical densities of equivalent!/ positioned protein spots from different samples, for example, from 
biological samples either treated or untreated with a test compound or therapeutic agent, are 
compared to identify any changes in protein spot density related to the treatment. The proteins in the 

15 spots are partially sequenced using, for example, standard methods employing chemical or enzymatic 
cleavage followed by mass spectrometry. The identity of the protein in a spot may be determined by 
comparing its partial sequence, preferably of at least 5 contiguous amino acid residues, to the 
polypeptide sequences, of the present invention. In some cases, further sequence data may be 
obtained for definitive protein identification. 

20 A proteomic profile may also be generated using antibodies specific for TRICH to quantify 

the levels of TRICH expression. In one embodiment, the antibodies are used as elements on a 
microarray, and protein expression levels are quantified by exposing the microarray to the sample and 
detecting the levels of protein bound to each array element (Lueking, A. et al. (1999) Anal. Biochem 
270:103-111; Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be performed 

25 by a variety of methods known in the art, for example, by reacting the proteins in the sample with a 
thiol- or amino-reactive fluorescent compound and detecting the amount of fluorescence bound at 
each array element 

Toxicant signatures at the proteome level are also useful for toxicological screening, and 
should be analyzed in parallel with toxicant signatures at the transcript level. There is a poor 

30 correlation between transcript and protein abundances for some proteins in some tissues (Anderson, 
N.L. and J. Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be 
useful in the analysis of compounds which do not significantly affect the transcript image, but which 
alter the proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to 
rapid degradation of mRNA, so proteomic profiling may be more reliable and informative in such 

35 cases. 
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In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated 
biological sample are separated so that the amount of each protein can be quantified. The amount of 
each protein is compared to the amount of the corresponding protein in an untreated biological 
5 sample. A difference in the amount of protein between the two samples is indicative of a toxic 

response to the test compound in the treated sample. Individual proteins are identified by sequencing 
the amino acid residues of the individual proteins and comparing these partial sequences to the 
polypeptides of the present invention. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 

10 sample containing proteins with the test compound. Proteins from the biological sample are 
incubated with antibodies specific to the polypeptides of the present invention. The amount of 
protein recognized by the antibodies is quantified. The amount of protein in the treated biological 
sample is compared with the amount in an untreated biological sample. A difference in the amount of 
protein between the two samples is indicative of a toxic response to the test compound in the treated 

15 sample. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 

Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 

USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251116; Shalon, D. et al. 

(1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 
20 2155; and Heller, MJ. et al. (1997) U.S. Patent No. 5,605,662.) Various types of microarrays are 

well known and thoroughly described in DNA Microarrays: A Practical Approach, M. Schena, ed. 

(1999) Oxford University Press, London, hereby expressly incorporated by reference. 

In another embodiment of the invention, nucleic acid sequences encoding TRICH may be 

used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. 
25 Either coding or noncoding sequences may be used, and in some instances, noncoding sequences may 

be preferable over coding sequences. For example, conservation of a coding sequence among 
- members of a multi-gene family may potentially cause undesired cross hybridization during 

chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific 

region of a chromosome, or to artificial chromosome constructions, e.g., human artificial 
30 chromosomes (HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes 

(BACs), bacterial PI constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, JJ. 

et al. (1997) Nat Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, BJ. 

(1991) Trends Genet. 7: 149-154.) Once mapped, the nucleic acid sequences of the invention may be 

used to develop genetic linkage maps, for example, which correlate the inheritance of a disease state 
35 with the inheritance of a particular chromosome region or restriction fragment length polymorphism 
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(RFLP). (See, for example, Lander, E.S. and D. Botstein (1986) Proc. Nad. Acad. Sci. USA 83:7353- 
7357.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic 
map data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra, pp. 965-968.) Examples of genetic 

5 map data can be found in various scientific journals or at the Online Mendelian Inheritance in Man 
(OMIM) World Wide Web site. Correlation between the location of the gene encoding TRICH on a 
physical map and a specific disorder, or a predisposition to a specific disorder, may help define the 
region of DNA associated with that disorder and thus may further positional cloning efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 

10 linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, 
may reveal associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely 

15 localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, 
any sequences mapping to that area may represent associated or regulatory genes for further 
investigation. (See, e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of 
the instant invention may also be used to detect differences in the chromosomal location due to 
translocation, inversion, etc., among normal, carrier, or affected individuals. 

20 In another embodiment of the invention, TRICH, its catalytic or immunogenic fragments, or 

oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes 
between TRICH and the agent being tested may be measured. 

25 Another technique for drug screening provides for high throughput screening of compounds 

having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 
synthesized on a solid substrate. The test compounds are reacted with TRICH, or fragments thereof, 
and washed. Bound TRICH is then detected by methods well known in the art Purified TRICH can 

30 also be coated directly onto plates for use in the aforementioned drug screening techniques. 

Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a 
solid support 

In another embodiment, one may use competitive drug screening assays in which neutralizing 
antibodies capable of binding TRICH specifically compete with a test compound for binding TRICH. 
35 In this manner, antibodies can be used to detect the presence of any peptide which shares one or more 
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antigenic determinants with TRICK 

In additional embodiments, the nucleotide sequences which encode TRICH may be used in 
any molecular biology techniques that have yet to be developed, provided the new techniques rely on 
properties of nucleotide sequences that are currently known, including, but not limited to, such 
5 properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent The following embodiments are, 
therefore, to be construed as merely illustrative, and not limitative of the remainder of the disclosure 
in any way whatsoever. 

10 The disclosures of all patents, applications, and publications mentioned above and below, 

including U.S. Ser. No. 60/223,269, U.S. Ser. No. 60/224,456, U.S. Ser. No. 60/226,410, U.S. Ser. 
No. 60/228,140, U.S. Ser. No. 60/230,067, and U.S. Ser. No. 60/231,434, are hereby expressly 
incorporated by reference. 

15 EXAMPLES 
I. Construction of cDNA Libraries 

Incyte cDNAs were derived from cDNA libraries described in the UFESEQ GOLD database 
(Incyte Genomics, Palo Alto CA) and shown in Table 4, column 5. Some tissues were homogenized 
and lysed in guanidinium isothiocyanate, while others were homogenized and lysed in phenol or in a 

20 suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic solution of 
phenol and guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl cushions or 
extracted with chloroform RNA was precipitated from the lysates with either isopropanol or sodium 
acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 

25 purity. In some cases, RNA was treated with DNase. For most libraries, poly(A)+ RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 

30 In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 

libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods known in the art (See, e.g., Ausubel, 1997, supra, units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 

35 oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
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appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300- 
1000 bp) using SEPHACRYL S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 

5 PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), PCDNA2.1 plasmid 
(Invitrogen, Carlsbad CA), PBK-CMV plasmid (Stratagene), or pINCY (Incyte Genomics, Palo Alto 
CA), or derivatives thereof. Recombinant plasmids were transformed into competent E. coli cells 
including XLl-Blue, XLl-BlueMRF, or SOLR from Stratagene or DH5a, DH10B, or ElectroMAX 
DH10B from Life Technologies. 

10 H. Isolation of cDNA Clones 

Plasmids obtained as described in Example I were recovered from host cells by in vivo 
excision using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using 
at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an 
AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, 

15 QIAWELL 8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E. A.L. PREP 96 
plasmid purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0. 1 
ml of distilled water and stored, with or without lyophilization, at 4°C 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 

20 cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 
384-well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically 
using PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence 
scanner (Labsystems Oy, Helsinki, Finland). 
HI. Sequencing and Analysis 

25 Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 

Sequencing reactions were processed using standard methods or high-throughput instrumentation 
such as the ABI CATALYST 800 (Applied Biosystems) thermal cycler or the PTC-200 thermal 
cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the 
MICROLAB 2200 (Hamilton) liquid transfer system* cDNA sequencing reactions were prepared 

30 using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as 
the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides . 
were carried out using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the 
ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI 

35 protocols and base calling software; or other sequence analysis systems known in the art Reading 
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frames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 
1997, supra, unit 7.7). Some of the cDNA sequences were selected for extension using the techniques 
disclosed in Example VUL 

The polynucleotide sequences derived from Incyte cDNAs were validated by removing 

5 vector, linker, and poly(A) sequences and by masking ambiguous bases, using algorithms and 

programs based on BLAST, dynamic programming, and dinucleotide nearest neighbor analysis. The 
Incyte cDNA sequences or translations thereof were then queried against a selection of public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases, and 
BLOCKS, PRINTS, DOMO, PRODOM, and hidden Markov model (HMM)-based protein family 

10 databases such as PFAM. (HMM is a probabilistic approach which analyzes consensus primary 

structures of gene families. See, for example, Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) 
The queries were performed using programs based on BLAST, FAST A, BLIMPS, and HMMER. The 
Incyte cDNA sequences were assembled to produce full length polynucleotide sequences. 
Alternatively, GenBank cDNAs, GenBank ESTs, stitched sequences, stretched sequences, or 

15 Genscan-predicted coding sequences (see Examples IV and V) were used to extend Incyte cDNA 
assemblages to full length. Assembly was performed using programs based on Phred, Phrap, and 
Consed, and cDNA assemblages were screened for open reading frames using programs based on 
GeneMark, BLAST, and FASTA. The full length polynucleotide sequences were translated to derive 
the corresponding full length polypeptide sequences. Alternatively, a polypeptide of the invention 

20 may begin at any of the methionine residues of the full length translated polypeptide. Full length 
polypeptide sequences were subsequently analyzed by querying against databases such as the 
GenBank protein databases (genpept), SwissProt, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, 
and hidden Markov model (HMM)-based protein family databases such as PFAM. Full length 
polynucleotide sequences are also analyzed using MACDNASIS PRO software (Hitachi Software 

25 Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide 
and polypeptide sequence alignments are generated using default parameters specified by the 
CLUSTAL algorithm as incorporated into the MEGALIGN multisequence alignment program 
(DNASTAR), which also calculates the percent identity between aligned sequences. 

Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly of 

30 Incyte cDNA and full length sequences and provides applicable descriptions, references, and 

threshold parameters. The first column of Table 7 shows the tools, programs, and algorithms used, 
the second column provides brief descriptions thereof, the third column presents appropriate 
references, all of which are incorporated by reference herein in their entirety, and the fourth column 
presents, where applicable, the scores, probability values, and other parameters used to evaluate the 

35 strength of a match between two sequences (the higher the score or the lower the probability value, 
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the greater the identity between two sequences). 

The programs described above for the assembly and analysis of full length polynucleotide 
and polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ 
ID NO:31-60. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization 
5 and amplification technologies are described in Table 4, column 4. 

IV. Identification and Editing of Coding Sequences from Genomic DNA 

Putative transporters and ion channels were initially identified by running the Genscan gene 
identification program against public genomic sequence databases (e.g., gbpri and gbhtg). Genscan is 
a general-purpose gene identification program which analyzes genomic DNA sequences from a 

10 variety of organisms (See Burge, C. and S. Karlin (1997) J. MoL Biol. 268:78-94, and Burge, C. and 
S. Karlin (1998) Curr. Opin. Struct. Biol. 8:346-354). The program concatenates predicted exons to 
form an assembled cDNA sequence extending from a methionine to a stop codon. The output of 
Genscan is a FASTA database of polynucleotide and polypeptide sequences. The maximum range of 
sequence for Genscan to analyze at once was set to 30 kb. To determine which of these Genscan 

15 predicted cDNA sequences encode transporters and ion channels, the encoded polypeptides were 
analyzed by querying against PFAM models for transporters and ion channels. Potential transporters 
and ion channels were also identified by homology to Incyte cDNA sequences that had been 
annotated as transporters and ion channels. These selected Genscan-predicted sequences were then 
compared by BLAST analysis to the genpept and gbpri public databases. Where necessary, the 

20 Genscan-predicted sequences were then edited by comparison to the top BLAST hit from genpept to 
correct errors in the sequence predicted by Genscan, such as extra or omitted exons. BLAST analysis 
was also used to find any Incyte cDNA or public cDNA coverage of the Genscan-predicted 
sequences, thus providing evidence for transcription. When Incyte cDNA coverage was available, 
this information was used to correct or confirm the Genscan predicted sequence. Full length 

25 polynucleotide sequences were obtained by assembling Genscan-predicted coding sequences with 
Incyte cDNA sequences and/or public cDNA sequences using the assembly process described in 
Example in. Alternatively, full length polynucleotide sequences were derived entirely from edited or 
unedited Genscan-predicted coding sequences. 

V- Assembly of Genomic Sequence Data with cDNA Sequence Data 

30 "Stitched" Sequences 

Partial cDNA sequences were extended with exons predicted by the Genscan gene 
identification program described in Example IV. Partial cDNAs assembled as described in Example 
in were mapped to genomic DNA and parsed into clusters containing related cDNAs and Genscan 
exon predictions from one or more genomic sequences. Each cluster was analyzed using an algorithm 

35 based on graph theory and dynamic programming to integrate cDNA and genomic information, 
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generating possible splice variants that were subsequently confirmed, edited, or extended to create a 
full length sequence. Sequence intervals in which the entire length of the interval was present on 
more than one sequence in the cluster were identified, and intervals thus identified were considered to 
be equivalent by transitivity. For example, if an interval was present on a cDNA and two genomic 
5 sequences, then all three intervals were considered to be equivalent This process allows unrelated 
but consecutive genomic sequences to be brought together, bridged by cDNA sequence. Intervals 
thus identified were then "stitched" together by the stitching algorithm in the order that they appear 
along their parent sequences to generate the longest possible sequence, as well as sequence variants. 
Linkages between intervals which proceed along one type of parent sequence (cDNA to cDNA or 

10 genomic sequence to genomic sequence) were given preference over linkages which change parent 
type (cDNA to genomic sequence). The resultant stitched sequences were translated and compared 
by BLAST analysis to the genpept and gbpri public databases. Incorrect exons predicted by Genscan 
were corrected by comparison to the top BLAST hit from genpept Sequences were further extended 
with additional cDNA sequences, or by inspection of genomic DNA, when necessary. 

15 "Stretched" Sequences 

Partial DNA sequences were extended to full length with an algorithm based on BLAST 
analysis. First, partial cDNAs assembled as described in Example IE were queried against public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases 
using the BLAST program. The nearest GenBank protein homolog was then compared by BLAST 

20 analysis to either Incyte cDNA sequences or GenScan exon predicted sequences described in 
Example IV. A chimeric protein was generated by using the resultant high-scoring segment pairs 
(HSPs) to map the translated sequences onto the GenBank protein homolog. Insertions or deletions 
may occur in the chimeric protein with respect to the original GenBank protein homolog. The 
GenBank protein homolog, the chimeric protein, or both were used as probes to search for 

25 homologous genomic sequences from the public human genome databases. Partial DNA sequences 
were therefore "stretched" or extended by the addition of homologous genomic sequences. The 
resultant stretched sequences were examined to determine whether it contained a complete gene. 
VI. Chromosomal Mapping of TRICH Encoding Polynucleotides 

The sequences which were used to assemble SEQ ID NO:31-60 were compared with 

30 sequences from the Incyte UFESEQ database and public domain databases using BLAST and other 
implementations of the Smith-Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:31-60 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGQ, Whitehead Institute for 

35 Genome Research (WIGR), and Genethon were used to determine if any of the clustered sequences 
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had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

Map locations are represented by ranges, or intervals, of human chromosomes. The map 
position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's p- 
5 arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between 
chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 
humans, although this can vary widely due to hot and cold spots of recombination.) The cM 
distances are based on genetic markers mapped by G6n6thon which provide boundaries for radiation 
hybrid markers whose sequences were included in each of the clusters. Human genome maps and 
10 other resources available to the public, such as the NCBI "GeneMap^" World Wide Web site 
(http://www.ncbi.nlm-nih.gov/genemap/), can be employed to determine if previously identified 
disease genes map within or in proximity to the intervals indicated above. 
VII. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
15 gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 
from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra, ch. 7; Ausubel 
(1995) supra, ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in cDNA databases such as GenBank or LBFESEQ (Incyte Genomics). This analysis is 
20 much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the 

computer search can be modified to determine whether any particular match is categorized as exact or 
similar. The basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 
25 5 x minimum {length(Seq. 1), length(Seq. 2)} 

The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. The product score is a normalized value between 0 and 100, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 

30 product is divided by (5 times the length of the shorter of the two sequences). The BLAST score is 
calculated by assigning a score of +5 for every base that matches in a high-scoring segment pair 
(HSP), and -4 for every mismatch. Two sequences may share more than one HSP (separated by 
gaps). If there is more than one HSP, then the pair with the highest BLAST score is used to calculate 
the product score. The product score represents a balance between fractional overlap and quality in a 

35 BLAST alignment. For example, a product score of 100 is produced only for 100% identity over the 
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entire length of the shorter of the two sequences being compared. A product score of 70 is produced 
either by 100% identity and 70% overlap at one end, or by 88% identity and 100% overlap at the 
other. A product score of 50 is produced either by 100% identity and 50% overlap at one end, or 79% 
identity and 100% overlap. 

5 Alternatively, polynucleotide sequences encoding TRICH are analyzed with respect to the 

tissue sources from which they were derived. For example, some full length sequences are 
assembled, at least in part, with overlapping Incyte cDNA sequences (see Example HI). Each cDNA 
sequence is derived from a cDNA library constructed from a human tissue. Each human tissue is 
classified into one of the following organ/tissue categories: cardiovascular system; connective tissue; 

10 digestive system; embryonic structures; endocrine system; exocrine glands; genitalia, female; 
genitalia, male; germ cells; hemic and immune system; liver, musculoskeletal system; nervous 
system; pancreas; respiratory system; sense organs; skin; stomatognathic system; unclassified/mixed; 
or urinary tract The number of libraries in each category is counted and divided by the total number 
of libraries across all categories. Similarly, each human tissue is classified into one of the following 

15 disease/condition categories: cancer, cell line, developmental, inflammation, neurological, trauma, 
cardiovascular, pooled, and other, and the number of libraries in each category is counted and divided 
by the total number of libraries across all categories. The resulting percentages reflect the tissue- and 
disease-specific expression of cDNA encoding TRICH. cDNA sequences and cDNA library/tissue 
information are found in the LIFESEQ GOLD database (Incyte Genomics, Palo Alto CA). 

20 Vm. Extension of TRICH Encoding Polynucleotides 

Full length polynucleotide sequences were also produced by extension of an appropriate 
fragment of the full length molecule using oligonucleotide primers designed from this fragment One 
primer was synthesized to initiate 5' extension of the known fragment, and the other primer was 
synthesized to initiate 3' extension of the known fragment The initial primers were designed using 

25 OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 
nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target 
sequence at temperatures of about 68°C to about 72°C. Any stretch of nucleotides which would 
result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 

30 extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art PGR 
was performed in 96-well plates using the PTC-200 thermal cycler (MI Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2 *, (NH^SO,, 
and 2-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 

35 (Life Technologies), and Pfii DNA polymerase (Stratagene), with the following parameters for primer 
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pair Pa A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 
2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1: 94°C, 3 min; Step 2: 
94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
5 Step 6: 68 °C, 5 min; Step 7: storage at 4°C 

The concentration of DNA in each well was determined by dispensing 100 ftl PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 pi of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II 
10 (Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 fA to 10 fA aliquot of the reaction mixture was analyzed by 
electrophoresis on a 1 % agarose gel to determine which reactions were successful in extending the 
sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 

15 digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones 
were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 

20 Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site 
overhangs, and transfected into competent E. coli cells. Transformed cells were selected on 
antibiotic-containing media, and individual colonies were picked and cultured overnight at 37 °C in 
384-well plates in LB/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 

25 (Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 

parameters: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; 
Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C DNA was 
quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA 
recoveries were reamplified using the same conditions as described above. Samples were diluted 

30 with 20% dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer sequencing 
primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM 
BIGDYE Terminator cycle sequencing ready reaction kit (AppHed Biosystems). 

hi like manner, full length polynucleotide sequences are verified using the above procedure or 
are used to obtain 5' regulatory sequences using the above procedure along with oligonucleotides 

35 designed for such extension, and an appropriate genomic library. 
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IX. labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:31-60 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with larger nucleotide 

5 fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 /uCi of 
[y- 32 P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 
SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). 

10 An aliquot containing 10 7 counts per minute of the labeled probe is used in a typical membrane-based 
hybridization analysis of human genomic DNA digested with one of the following endonucleases: 
Ase I, Bgl E, Eco RI, PstI, Xbal, or Pvu E (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 

15 hours at 40 °C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

X. Microarrays 

20 The linkage or synthesis of array elements upon a microarray can be achieved utilizing 

photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweiler, supra .), 
mechanical microspotting technologies, and derivatives thereof. The substrate in each of the 
aforementioned technologies should be uniform and solid with a non-porous surface (Schena (1999), 
supra) . Suggested substrates include silicon, silica, glass slides, glass chips, and silicon wafers. 

25 Alternatively, a procedure analogous to a dot or slot blot may also be used to arrange and link 
elements to the surface of a substrate using thermal, UV, chemical, or mechanical bonding 
procedures. A typical array may be produced using available methods and machines well known to 
those of ordinary skill in the art and may contain any appropriate number of elements. (See, e.g., 
Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645; 

30 Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 
comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 
selected using software well known in the art such as LASERGENE software (DNASTAR). The 
array elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 

35 biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
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After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array element Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element 

5 on the microarray may be assessed. In one embodiment, microarray preparation and usage is 
described in detail below. 
Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
poly(A) + RNA is purified using the oligo-(dT) cellulose method. Each poly(A) + RNA sample is 

10 reverse transcribed using MMLV reverse-transcriptase, 0.05 pg//xl oligo-(dT) primer (21mer), IX 
first strand buffer, 0.03 units//il RNase inhibitor, 500 pM dATP, 500 pM dGTP, 500 fiM dTTP, 40 
fiM dCTP, 40 /iM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly (A) + RNA with 
GEMBRIGHT kits (Incyte). Specific control poly(A) + RNAs are synthesized by in vitro transcription 

15 from non-coding yeast genomic DNA. After incubation at 37° C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 
incubated for 20 minutes at 85 °C to the stop the reaction and degrade the RNA. Samples are purified 
using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
(CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 

20 using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and 
resuspended in 14 /il 5X SSC/0.2% SDS. 
Microarray Preparation 

Sequences of the present invention are used to generate array elements. Each array element 

25 is amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification 
uses primers complementary to the vector sequences flanking the cDNA insert. Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 
jig. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia 
Biotech). 

30 Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 

slides (Corning) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, 
and coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 
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110°Coven. 

Array elements are applied to the coated glass substrate using a procedure described in US 
Patent No. 5,807,522, incorporated herein by reference. 1 pi of the array element DNA, at an average 
concentration of 100 ng/pl, is loaded into the open capillary printing element by a high-speed robotic 
5 apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60°C followed by washes in 
10 0.2% SDS and distilled water as before. 
Hybridization 

Hybridization reactions contain 9 fil of sample mixture consisting of 0.2 pig each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 
mixture is heated to 65°C for 5 minutes and is aliquoted onto the microarray surface and covered 

15 with an 1 .8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just 
slightly larger than a microscope slide. The chamber is kept at 100% humidity internally by the 
addition of 140 /il of 5X SSC in a corner of the chamber. The chamber containing the arrays is 
incubated for about 6.5 hours at 60° C. The arrays are washed for 10 min at 45° C in a first wash 
buffer (IX SSC, 0.1% SDS), three times for 10 minutes each at 45°C in a second wash buffer (0.1X 

20 SSC), and dried. 
Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 

25 focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 

30 Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 

Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
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although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that 

5 location to be correlated with a weight ratio of hybridizing species of 1 : 100,000. When two samples 
from different sources (e.g., representing test and control cells), each labeled with a different 
fluorophore, are hybridized to a single array for the purpose of identifying genes that are 
differentially expressed, the calibration is done by labeling samples of the calibrating cDNA with the 
two fluorophores and adding identical amounts of each to the hybridization mixture. 

10 The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 

(AID) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer- The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 

15 measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 
emission spectra) between the fluorophores using each fluorophore' s emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each 
spot is centered in each element of the grid. The fluorescence signal within each element is then 
integrated to obtain a numerical value corresponding to the average intensity of the signal. The 

20 software used for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 
XI. Complementary Polynucleotides 

Sequences complementary to the TRICH-encoding sequences, or any parts thereof, are used 
to detect, decrease, or inhibit expression of naturally occurring TRIOL Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 

25 procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are 
designed using OIJGO 4.06 software (National Biosciences) and the coding sequence of TRICH. To 
inhibit transcription, a complementary oligonucleotide is designed from the most unique 5' sequence 
and used to prevent promoter binding to the coding sequence. To inhibit translation, a 
complementary oligonucleotide is designed to prevent ribosomal binding to the TRICH-encoding 

30 transcript 

XH. Expression of TRICH 

Expression and purification of TRICH is achieved using bacterial or virus-based expression 
systems. For expression of TRICH in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 

35 transcription. Examples of such promoters include, but are not limited to, the trp-lac {tac) hybrid 
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promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express TRICH upon induction with isopropyl beta-D- • 
thiogalactopyranoside QPTG). Expression of TRICH in eukaryotic cells is achieved by infecting 

5 insect or mammalian cell lines with recombinant Autopraphica californica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding TRICH by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 

10 infect Spodoptera frupiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Infection of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E.K. 
et al. (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 
7:1937-1945.) 

In most expression systems, TRICH is synthesized as a fusion protein with, e.g., glutathione 

15 S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26- 
kilodalton enzyme from Schistosoma iaponicum , enables the purification of fusion proteins on 
immobilized glutathione under conditions that maintain protein activity and antigenicity (Amersham 
Pharmacia Biotech). Following purification, the GST moiety can be proteolytically cleaved from 

20 TRICH at specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity 

purification using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman 
Kodak). 6-His, a stretch of six consecutive histidine residues, enables purification on metal-chelate 
resins (QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, 
supra, ch. 10 and 16). Purified TRICH obtained by these methods can be used directly in the assays 

25 shown in Examples XVI, XVII, and XVHI, where applicable. 
Xm. Functional Assays 

TRICH function is assessed by expressing the sequences encoding TRICH at physiologically 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 

30 include PCMV SPORT (life Technologies) and PCR3. 1 (Invitrogen, Carlsbad CA), both of which 
contain the cytomegalovirus promoter. 5-10 A*g of recombinant vector are transiently transfected into 
a human cell line, for example, an endothelial or hematopoietic cell line, using either liposome 
formulations or electroporation. 1-2 p% of an additional plasmid containing sequences encoding a 
marker protein are co-transfected. Expression of a marker protein provides a means to distinguish 

35 transfected cells from nontransfected cells and is a reliable predictor of cDNA expression from the 
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recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP; 
Clontech), CD64, or a CD64-GFP fusion protein. How cytometry (FCM), an automated, laser optics- 
based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to evaluate 
the apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of 
5 fluorescent molecules that diagnose events preceding or coincident with cell death. These events 
include changes in nuclear DNA content as measured by staining of DNA with propidium iodide; 
changes in cell size and granularity as measured by forward light scatter and 90 degree side light 
scatter, down-regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; 
alterations in expression of cell surface and intracellular proteins as measured by reactivity with 

10 specific antibodies; and alterations in plasma membrane composition as measured by the binding of 
fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are 
discussed in Qrmerod, M.G. (1994) Flow Cytometry , Oxford, New York NY. 

The influence of TRICH on gene expression can be assessed using highly purified 
populations of cells transfected with sequences encoding TRICH and either CD64 or CD64-GFP. 

15 CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions 
of human immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected 
cells using magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake 
Success NY). mRNA can be purified from the cells using methods well known by those of skill in 
the art. Expression of mRNA encoding TRICH and other genes of interest can be analyzed by 

20 northern analysis or microarray techniques. 

XTV. Production of TRICH Specific Antibodies 

TRICH substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488^*95), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

25 Alternatively, the TRICH amino acid sequence is analyzed using LASERGENE software 

(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art. (See, e.g., Ausubel, 1995, supra, ch. 11.) 

30 Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 

peptide synthesizer (Applied Biosystems) using FMOC chemistry and coupled to KLH (Sigma- 
Aldrich, St. Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysucciniinide ester (MBS) to 
increase immunogenicity. (See, e.g., Ausubel, 1995, supra .) Rabbits are immunized with the 
oligopeptide-KLH complex in complete Freund's adjuvant. Resulting antisera are tested for 

35 antipeptide and anti-TRICH activity by, for example, binding the peptide or TRICH to a substrate, 
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blocking with 1% BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated goat 
anti-rabbit IgG. 

XV. Purification of Naturally Occurring TRICH Using Specific Antibodies 
Naturally occurring or recombinant TRICH is substantially purified by immunoaffinity 

5 chromatography using antibodies specific for TRICH. An immunoaffinity column is constructed by 
covalently coupling anti-TRICH antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

Media containing TRICH are passed over the immunoaffinity column, and the column is 

10 washed under conditions that allow the preferential absorbance of TRICH (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibody/TRICH binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or thiocyanate ion), and TRICH is collected. 

XVI. Identification of Molecules Which Interact with TRICH 

15 Molecules which interact with TRICH may include transporter substrates, agonists or 

antagonists, modulatory proteins such as G(fy proteins (Reimann, supra) or proteins involved in 
TRICH localization or clustering such as MAGUKs (Craven, supra) . TRICH, or biologically active 
fragments thereof, are labeled with 125 I Bolton-Hunter reagent (See, e.g., Bolton A.E. and W.M. 
Hunter (1973) Biochem J. 133:529-539.) Candidate molecules previously arrayed in the wells of a 

20 multi-well plate are incubated with the labeled TRICH, washed, and any wells with labeled TRICH 
complex are assayed. Data obtained using different concentrations of TRICH are used to calculate 
values for the number, affinity, and association of TRICH with the candidate molecules. 

Alternatively, proteins that interact with TRICH are isolated using the yeast 2-hybrid system 
(Fields, S. and O. Song (1989) Nature 340:245-246). TRICH, or fragments thereof, are expressed as 

25 fusion proteins with the DNA binding domain of Gal4 or lexA, and potential interacting proteins are 
expressed as fusion proteins with an activation domain. Interactions between the TRICH fusion 
protein and the TRICH interacting proteins (fusion proteins with an activation domain) reconstitute a 
transactivation function that is observed by expression of a reporter gene. Yeast 2-hybrid systems are 
commercially available, and methods for use of the yeast 2-hybrid system with ion channel proteins 

30 are discussed in Niethammer, M. and M. Sheng (1998, Meth. Enzymol. 293: 104-122). 

TRICH may also be used in the PATHCALUNG process (CuraGen Corp., New Haven CT) 
. which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. 
Patent No. 6,057,101). 

35 Potential TRICH agonists or antagonists may be tested for activation or inhibition of TRICH 
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ion channel activity using the assays described in section XVHL 
XVH. Demonstration of TRICH Activity 

Ion channel activity of TRICH is demonstrated using an electrophysiological assay for ion 
conductance. TRICH can be expressed by transforming a mammalian cell line such as COS7, HeLa 

5 or CHO with a eukaryotic expression vector encoding TRICH. Eukaryotic expression vectors are 
commercially available, and the techniques to introduce them into cells are well known to those 
skilled in the art A second plasmid which expresses any one of a number of marker genes, such as 6- 
galactosidase, is co-transformed into the cells to allow rapid identification of those cells which have 
taken up and expressed the foreign DNA. The cells are incubated for 48-72 hours after 

10 transformation under conditions appropriate for the cell line to allow expression and accumulation of 
TRICH and fl-galactosidase. 

Transformed cells expressing 6-galactosidase are stained blue when a suitable colorimetric 
substrate is added to the culture media under conditions that are well known in the art. Stained cells 
are tested for differences in membrane conductance by electrophysiological techniques that are well 

15 known in the art Untransformed cells, and/or cells transformed with either vector sequences alone or 
6-galactosidase sequences alone, are used as controls and tested in parallel. Cells expressing TRICH 
will have higher anion or cation conductance relative to control cells. The contribution of TRICH to 
conductance can be confirmed by incubating the cells using antibodies specific for TRICH. The 
antibodies will bind to the extracellular side of TRICH, thereby blocking the pore in the ion channel, 

20 and the associated conductance. 

Alternatively, ion channel activity of TRICH is measured as current flow across a TRICH- 
containing Xenopus laevis oocyte membrane using the two-electrode voltage-clamp technique (Ishi et 
al., supra; Jegla, T. and L. Salkoff (1997) J. Neurosci. 17:32-44). TRICH is subcloned into an 
appropriate Xenopus oocyte expression vector, such as pBF, and 0.5-5 ng of mRNA is injected into 

25 mature stage IV oocytes. Injected oocytes are incubated at 18 °C for 1-5 days. Inside-out 

macropatches are excised into an intracellular solution containing 116 mM K-gluconate, 4 mM KC1, 
and 10 mM Hepes (pH 7.2). The intracellular solution is supplemented with varying concentrations 
of the TRICH mediator, such as cAMP, cGMP, or Ca +2 (in the form of CaCy, where appropriate. 
Electrode resistance is set at 2-5 MQ and electrodes are filled with the intracellular solution lacking 

30 mediator. Experiments are performed at room temperature from a holding potential of 0 mV. 

Voltage ramps (2.5 s) from -100 to 100 mV are acquired at a sampling frequency of 500 Hz. Current 
measured is proportional to the activity of TRICH in the assay. 

In particular, the activity of TRICH-20 is measured as Ca 2+ conductance, the activity of 
TRICH-22 is measured as CI- conductance in the presence of glycine, the activity of TRICH-23 is 

84 



WO 02/12340 



PCT/US01/24217 



measured as Ca 2+ conductance, and the activity of TRICH-24 is measured as K + conductance in the 
presence of Ca 2 \ and the activity of TRICH-26 is measured as cation conductance in the presence of 
heat. 

' Transport activity of TRICH is assayed by measuring uptake of labeled substrates substrates 
5 (including but not limited to, maltose, glucose, or glycogen) into Xenopus laevis oocytes. Oocytes at 
stages V and VI are injected with TRICH mRNA (10 ng per oocyte) and incubated for 3 days at 18°C 
in OR2 medium (82.5mM NaCl, 2.5 mM KC1, ImM CaCl 2 , ImM MgCl 2 , ImM Na2HP0 4 , 5 mM 
Hepes, 3.8 mM NaOH , 50jig/ml gentamycin, pH 7.8) to allow expression of TRICH. Oocytes are 
then transferred to standard uptake medium (lOQmM Nad, 2 mM KC1, ImM CaCl 2 , ImM MgCl 2 , 10 
10 mM Hepes/Tris pH 7.5). Uptake of various substrates (e.g., amino acids, sugars, drugs, ions, and 
neurotransmitters) is initiated by adding labeled substrate (e.g. radiolabeled with *H, fluorescently 
labeled with rhcdarnine, etc.) to the oocytes. After incubating for 30 minutes, uptake is terminated by 
washing the oocytes three times in NaMree medium, measuring the incorporated label, and 
comparing with controls. TRICH activity is proportional to the level of internalized labeled substrate. 
15 In particular, test substrates include sulfate for TRICH-13, tricarboxylates for TRICH-21, 

dicarboxylates and Na + for TRICH-25, ornithine for TRICH-27, and monocarboxylates for TRICH- 
28. 

ATPase activity associated with TRICH can be measured by hydrolysis of radiolabeled ATP- 
[Y- 32 P], separation of the hydrolysis products by chromatographic methods, and quantitation of the 

20 recovered 32 P using a scintillation counter. The reaction mixture contains ATP-[y- 32 P] and varying 
amounts of TRICH in a suitable buffer incubated at 37 °C for a suitable period of time. The reaction 
is terminated by acid precipitation with trichloroacetic acid and then neutralized with base, and an 
aliquot of the reaction mixture is subjected to membrane or filter paper-based chromatography to 
separate the reaction products. The amount of ^liberated is counted in a scintillation counter. The 

25 amount of radioactivity recovered is proportional to the ATPase activity of TRICH in the assay. 
XVHL Identification of TRICH Agonists and Antagonists 

TRICH is expressed in a eukaryotic cell line such as CHO (Chinese Hamster Ovary) or HEK 
(Human Embryonic Kidney) 293. Ion channel activity of the transformed cells is measured in the 
presence and absence of candidate agonists or antagonists. Ion channel activity is assayed using 

30 patch clamp methods well known in the art or as described in Example XVH Alternatively, ion 
channel activity is assayed using fluorescent techniques that measure ion flux across the cell 
membrane (Velicelebi, G. et al. (1999) Meth. Enzymol. 294:2047; West, M.R. and GR. Molloy 
(1996) Anal. Biochem. 241:51-58). These assays may be adapted for high-throughput screening 
using microplates. Changes in internal ion concentration are measured using fluorescent dyes such as 

35 the Ca 2+ indicator Fluo-4 AM, sodium-sensitive dyes such as SBFI and sodium green, or the CI" 
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indicator MQAE (all available from Molecular Probes) in combination with the FLIPR fluorimetric 
plate reading system (Molecular Devices). In a more generic version of this assay, changes in 
membrane potential caused by ionic flux across the plasma membrane are measured using oxonyl 
dyes such as DiBAC 4 (Molecular Probes). DiBAC 4 equilibrates between the extracellular solution 
5 and cellular sites according to the cellular membrane potential. The dye's fluorescence intensity is 
20-fold greater when bound to hydrophobic intracellular sites, allowing detection of DiBAC 4 entry 
into the cell (Gonzalez, J.E. and P. A. Negulescu (1998) Curr. Opin. Biotechnol. 9:624-631). 
Candidate agonists or antagonists may be selected from known ion channel agonists or antagonists, 
peptide libraries, or combinatorial chemical libraries. 

10 

Various modifications and variations of the described methods and systems of the invention 
will be apparent to those skilled in the art without departing from the scope and spirit of the 
invention. Although the invention has been described in connection with certain embodiments, it 
should be understood that the invention as claimed should not be unduly limited to such specific 
15 embodiments. Indeed, various modifications of the described modes for carrying out the invention 
which are obvious to those skilled in molecular biology or related fields are intended to be within the 
scope of the following claims. 
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What is claimed is: 

1. An isolated polypeptide selected from the group consisting of: 

a) a polypeptide comprising an amino acid sequence selected from the group consisting of 
5 SEQIDNO:1-30, 

b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% identical 
to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30, 

c) a biologically active fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ED NO: 1-30, and 

10 d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from 

the group consisting of SEQ ID NO: 1-30. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:l- 

30. 

15 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

20 5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID 

NO.31-60. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

25 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

30 9. A method of producing a polypeptide of claim 1, the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 
comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of 
claim 1, and 



133 



WO 02/12340 



PCT/US01/24217 



b) recovering the polypeptide so expressed. 

10. An isolated antibody which specifically binds to a polypeptide of claim 1 . 

5 1 1. An isolated polynucleotide selected from the group consisting of: 

a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting 
ofSEQIDNO:31-60, 

b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 
identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:31-60, 

10 c) a polynucleotide complementary to a polynucleotide of a), 

d) a polynucleotide complementary to a polynucleotide of b), and 

e) an RNA equivalent of a)-d). 

12. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
15 polynucleotide of claim 1 1 . 

13. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 11, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 

20 comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof. 

25 

14. A method of claim 13, wherein the probe comprises at least 60 contiguous nucleotides. 

15. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1, the method comprising: 

30 a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 

amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof. 
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16. A composition comprising a polypeptide of claim 1 and a pharmaceutically acceptable 
excipient 

17. A composition of claim 16, wherein the polypeptide has an amino acid sequence selected 
5 from the group consisting of SEQ ID NO: 1-30. 

18. A method for treating a disease or condition associated with decreased expression of 
functional TRICH, comprising administering to a patient in need of such treatment the composition 
of claim 16. 

10 

19. A method of screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

15 

20. A composition comprising an agonist compound identified by a method of claim 19 and 
a pharmaceutically acceptable excipient 

21. A method for treating a disease or condition associated with decreased expression of 
20 functional TRICH, comprising administering to a patient in need of such treatment a composition of 

claim 20. 

22. A method of screening a compound for effectiveness as an antagonist of a polypeptide of 
claim 1, the method comprising: 

25 a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

23. A composition comprising an antagonist compound identified by a method of claim 22 
and a pharmaceutically acceptable excipient 

30 

24. A method for treating a disease or condition associated with overexpression of functional 
TRICH, comprising administering to a patient in need of such treatment a composition of claim 23. 

25. A method of screening for a compound that specifically binds to the polypeptide of claim 



135 



WO 02/12340 



PCT/US01/24217 



1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby identifying a 
5 compound that specifically binds to the polypeptide of claim 1 . 

26. A method of screening for a compound that modulates the activity of the polypeptide of 
claim 1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under conditions 
10 permissive for the activity of the polypeptide of claim 1, 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test compound, 

and 

c) comparing the activity of the polypeptide of claim 1 in the presence of the test compound 
with the activity of the polypeptide of claim 1 in the absence of the test compound, wherein a change 

15 in the activity of the polypeptide of claim 1 in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide of claim 1. 



27. A method of screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 
20 comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, under conditions 
suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

c) comparing the expression of the target polynucleotide in the presence of varying amounts 
25 of the compound and in the absence of the compound. 



28. A method of assessing toxicity of a test compound, the method comprising: 

a) treating a biological sample containing nucleic acids with the test compound, 

b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at 
30 least 20 contiguous nucleotides of a polynucleotide of claim 1 1 under conditions whereby a specific 

hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence of a polynucleotide of claim 
1 1 or fragment thereof, 

c) quantifying the amount of hybridization complex, and 

35 d) comparing the amount of hybridization complex in the treated biological sample with the 
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amount of hybridization complex in an untreated biological sample, wherein a difference in the 
amount of hybridization complex in the treated biological sample is indicative of toxicity of the test 
compound. 

5 29. A diagnostic test for a condition or disease associated with the expression of TRICH in a 

biological sample, the method comprising: 

a) combining the biological sample with an antibody of claim 10, under conditions suitable 
for the antibody to bind the polypeptide and form an antibody polypeptide complex, and 

b) detecting the complex, wherein the presence of the complex correlates with the presence 
10 of the polypeptide in the biological sample. 

30. The antibody of claim 10, wherein the antibody is: 

a) a chimeric antibody, 

b) a single chain antibody, 
15 c) a Fab fragment, 

d) a F(ab') 2 fragment, or 

e) a humanized antibody. 

3 1. A composition comprising an antibody of claim 10 and an acceptable excipient. 

20 

32. A method of diagnosing a condition or disease associated with the expression of TRICH 
in a subject, comprising administering to said subject an effective amount of the composition of claim 
31. 

25 33. A composition of claim 31, wherein the antibody is labeled. 

34. A method of diagnosing a condition or disease associated with the expression of TRICH 
in a subject, comprising administering to said subject an effective amount of the composition of claim 
33. 

30 

35. A method of preparing a polyclonal antibody with the specificity of the antibody of claim 
10, the method comprising: 

a) immunizing an animal with a polypeptide having an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-30, or an immunogenic fragment thereof, under conditions to 
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elicit an antibody response, 

b) isolating antibodies from said animal, and 

c) screening the isolated antibodies with the polypeptide, thereby identifying a polyclonal 
antibody which binds specifically to a polypeptide having an amino acid sequence selected from the 

5 group consisting of SEQ ID NO: 1-30. 

36. An antibody produced by a method of claim 35. 

37. A composition comprising the antibody of claim 36 and a suitable carrier. 

10 

38. A method of making a monoclonal antibody with the specificity of the antibody of claim 
10, the method comprising: 

a) immunizing an animal with a polypeptide having an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-30, or an immunogenic fragment thereof, under conditions to 

15 elicit an antibody response, 

b) isolating antibody producing cells from the animal, 

c) fusing the antibody producing cells with immortalized cells to form monoclonal antibody- 
producing hybridoma cells, 

d) culturing the hybridoma cells, and 

20 e) isolating from the culture monoclonal antibody which binds specifically to a polypeptide 

having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30. 

39. A monoclonal antibody produced by a method of claim 38. 

25 40. A composition comprising the antibody of claim 39 and a suitable carrier. 

41. The antibody of claim 10, wherein the antibody is produced by screening a Fab 
expression library. 

. 30 42. The antibody of claim 10, wherein the antibody is produced by screening a recombinant 

immunoglobulin library. 

43. A method of detecting a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-30 in a sample, the method comprising: 
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a) incubating the antibody of claim 10 with a sample under conditions to allow specific 
binding of the antibody and the polypeptide, and 

b) detecting specific binding, wherein specific binding indicates the presence of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30 in 

5 the sample. 

44. A method of purifying a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-30 from a sample, the method comprising: 

a) incubating the antibody of claim 10 with a sample under conditions to allow specific 
10 binding of the antibody and the polypeptide, and 

b) separating the antibody from the sample and obtaining the purified polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-30. 



15 



45. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 1. 

46. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:2. 

47. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:3. 
20 48. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:4. 

49. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:5. 

50. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:6. 

25 

51. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:7. 

52. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:8. 
30 53. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:9. 

54. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 10. 

55. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 11. 
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10 



56. A polypeptide of claim 



57. A polypeptide of claim 



58. A polypeptide of claim 



59. A polypeptide of claim 



60. A polypeptide of claim 



61. A polypeptide of claim 



62. A polypeptide of claim 



63. A polypeptide of claim 



64. A polypeptide of claim 



65. A polypeptide of claim 



66. A polypeptide of claim 



67. A polypeptide of claim 



25 68. A polypeptide of claim 



69. A polypeptide of claim 



15 



20 



30 



70. A polypeptide of claim 



71. A polypeptide of claim 



, comprising the amino acid sequence of SEQ ID NO: 12. 



, comprising the amino acid sequence of SEQ ID NO: 13. 



, comprising the amino acid sequence of SEQ ID NO: 14. 



, comprising the amino acid sequence of SEQ ID NO: 15. 



, comprising the amino acid sequence of SEQ ID NO: 16. 



, comprising the amino acid sequence of SEQ ID NO: 17. 



, comprising the amino acid sequence of SEQ ID NO: 18. 



, comprising the amino acid sequence of SEQ ID NO:19. 



, comprising the amino acid sequence of SEQ ID NO:20. 



, comprising the amino acid sequence of SEQ ID NO:21. 



, comprising the amino acid sequence of SEQ ID NO:22. 



, comprising the amino acid sequence of SEQ ID NO:23. 



, comprising the amino acid sequence of SEQ ID NO:24. 



, comprising the amino acid sequence of SEQ ID NO:25. 



, comprising the amino acid sequence of SEQ ID NO:26. 



, comprising the amino acid sequence of SEQ ID NO:27. 



72. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:28. 
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73. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:29. 

74. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:30. 

5 75. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:31. 

76. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:32. 

10 

77. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:33. 

78. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ E) 

15 NO:34. 

79. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:35. 

20 80. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:36. 

8 1 . A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:37. 

25 

82. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:38. 

83. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

30 NO:39. 

84. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:40. 
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85. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:41. 

86. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

5 NO:42. 

87. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:43. 

10 88. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:44. 

89. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:45. 

15 

90. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:46. 

91. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

20 NO:47. 

92. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:48. 

25 93. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:49. 

94. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:50. 

30 

95. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:51. 

96. A polynucleotide of claim 1 1, conprising the polynucleotide sequence of SEQ ID 
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NO:52. 



97. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:53. 

5 

98. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:54. 

99. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

10 NO:55. 

100. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:56. 

15 101. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:57. 

102. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:58. 

20 

103. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:59. 

104. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

25 NO:60. 
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<110> INCYTE GENOMICS, INC. 
YUE, Henry 
THORNTON, Michael 
RAMKUMAR, Jayalaxmi 
TANG, Y. Tom 
AZIMZAI, Yalda 
BAUGHN, Marian R. 
YANG, Junming 
YAO, Monique G. 
LAL, Preeti 
WALIA, Narinder K. 
GANDHI, Ameena R. 
HAFALIA, April J. A. 
NGUYEN, Danniel B. 
PATTERSON, Chandra 
ELLIOTT, Vicki S. 
TRIBOULEY, Catherine M. 
LU, Ealing Aina M. 
XU, Yuming 
REDDY, Roopa 
HERNANDEZ, Roberto 
BOROWSKY, Mark L. 
LO, Terence P. 
LU, Yan 

POLICKY, Jennifer L. 
GREENE, Barrie D. 
SANJANWALA, Madhu S. 
RAUMANN, Brigette E. 
BURFORD, Neil 
ISON, Craig H. 
LEE, Ernestine A. 
DING, Li 
DAS , Debopr iya 
KALLICK, Deborah A. 
KHAN, Farrah A. 
SEILHAMER, Jeffrey J. 

<120> TRANSPORTERS AND ION CHANNELS 

<130> PI-0183 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> 60/223,269; 60/224,456; 60/226,410; 60/228,140; 60/230,067; 
60/231,434 

<151> 2000-08-03; 2000-08-10; 2000-08-18; 2000-08-25; 2000-08-31; 
2000-09-08 

<160> 60 

<170> PERL Program 

<210> 1 

<211> 308 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 2194064CD1 

<400> 1 

Met Val Gly Gly Val Leu Ala Ser Leu Gly Phe Val Phe Ser Ala 
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1 






5 










10 




15 


Phe 


Ala 


Ser 


Asp Leu 
20 


Leu 


His 


Leu 


Tyr 


Leu 
25 


Gly Leu Gly Leu 


Leu 
30 


Ala 


Gly 


Phe 


Gly Trp 
35 


Ala 


Leu 


Val 


Phe 


Ala 
40 


Pro Ala Leu Gly 


Thr 

45 


Leu 


Ser 


Arg 


Tyr Phe 
50 


Ser 


Arg 


Arg 


Arg 


Val 
55 


Leu Ala Val Gly 


Leu 
60 


Ala 


Leu 


Thr 


Gly Asn 
65 


Gly 


Ala 


Ser 


Ser 


Leu 
70 


Leu Leu Ala Pro 


Ala 
75 


Leu 


Gin 


Leu 


Leu Leu 
80 


Asp 


Thr 


Phe 


Gly 


Trp 
85 


Arg Gly Ala Leu 


Leu 
90 


Leu 


Leu 


Gly 


Ala He 
95 


Thr 


Leu 


His 


Leu 


Thr 
100 


Pro Cys Gly Ala 


Leu 
105 


Leu 


Leu 


Pro 


Leu Val 
110 


Leu 


Pro 


Gly 


Asp 


Pro 
115 


Pro Ala Pro Pro 


Arg 
120 


Ser 


Pro 


Leu 


Ala Ala 
125 


Leu 


Gly 


Leu 


Ser 


Leu 
130 


Phe Thr Arg Arg 


Ala 
135 


Phe 


Ser 


He 


Phe Ala 
140 


Leu 


Gly 


Thr 


Ala 


Leu 
145 


Val Gly Gly Gly 


Tyr 
150 


Phe 


Val 


Pro 


Tyr Val 
155 


His 


Leu 


Ala 


Pro 


His 
160 


Ala Leu Asp Arg 


Gly 
165 


Leu 


Gly 


Gly 


Tyr Gly 
170 


Ala 


Ala 


Leu 


Val 


Val 
175 


Ala Val Ala Ala 


Met 
180 


Gly 


Asp 


Ala 


Gly Ala 
185 


Arg 


Leu 


Val 


Cys 


Gly 
190 


Trp Leu Ala Asp 


Gin 
195 


Gly 


Trp 


Val 


Pro Leu 
200 


Pro 


Arg 


Leu 


Leu 


Ala 
205 


Val Phe Gly Ala 


Leu 
210 


Thr 


Gly 


Leu 


Gly Leu 
215 


Trp 


Val 


Val 


Gly 


Leu 
220 


Val Pro Val Val 


Gly 
225 


Gly 


Glu 


Glu 


Ser Trp 
230 


Gly 


Gly 


Pro 


Leu 


Leu 
235 


Ala Ala Ala Val 


Ala 
240 


Tyr 


Gly 


Leu 


Ser Ala 
245 


Gly 


Ser 


Tyr 


Ala 


Pro 
250 


Leu Val Phe Gly 


Val 
255 


Leu 


Pro 


Gly 


Leu Val 
260 


Gly 


Val 


Gly 


Gly 


Val 
265 


Val Gin Ala Thr 


Gly 
270 


Leu 


Val 


Met 


Met Leu 
275 


Met 


Ser 


Leu 


Gly 


Gly 
280 


Leu Leu Gly Pro 


Pro 
285 


Leu 


Ser 


Gly 


Lys Asp 
290 


Leu 


Ser 


Ser 


Gin 


He 
295 


Cys Leu Gin Leu 


Ser 
300 


Ser 


Ala 


Pro 


Gly Val 
305 


Arg 


Gly 


Phe 











<210> 2 

<211> 606 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc__feature 

<223> Incyte ID No: 2744094CD1 

<400> 2 



Met 


Ala 


Glu 


Gin 


Leu 


Ser 


Gin Gin 


Leu 


Pro 


Arg Thr Cys Leu 


Trp 


1 








5 










10 




15 


His 


Leu 


Tyr 


He 


Thr 


Thr 


Val 


Ser 


Leu 


Pro 


Gly Tyr Met Val 


Ser 










20 










25 




30 


Cys 


He 


He 


Phe 


Phe 


Phe 


Val 


Val 


Pro 


He 


Val Phe Leu Thr 


He 










35 










40 




45 


Phe 


Ser 


Phe 


Trp 


Trp 


Leu 


Ser 


Tyr 


Trp 


Leu 


Glu Gin Gly Ser 


Gly 










50 










55 




60 


Thr Asn 


Ser 


Ser 


Arg 


Glu 


Ser 


Asn 


Gly 


Thr 


Met Ala Asp Leu 


Gly 










65 










70 




75 


Asn 


He 


Ala 


Asp 


Asn 


Pro 


Gin 


Leu 


Ser 


Phe 


Tyr Gin Leu Val 


Tyr 
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80 










85 










90 
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Leu 


Asn 


Ala 


Leu 
95 


Leu 


Leu 


He 


Cys 


Val 
100 


Gly 


Val 


Cys 


Ser 


Ser 
105 


Gly 


He 


Phe 


Thr 


Lys 
110 


Val 


Thr 


Arg 


Lys 


Ala 
115 


Ser 


Thr 


Ala 


Leu 


His 
120 


Asn 


Lys 


Leu 


Phe 


Asn 
125 


Lys 


Val 


Phe 


Arg 


Cys 
130 


Pro 


Met 


Ser 


Phe 


Phe 
135 


Asp 


Thr 


He 


Pro 


He 
140 


Gly 


Arg 


Leu 


Leu 


Asn 
145 


Cys 


Phe 


Ala 


Gly 


Asp 
150 


Leu 


Glu 


Gin 


Leu 


Asp 
155 


Gin 


Leu 


Leu 


Pro 


He 
160 


Phe 


Ser 


Glu 


Gin 


Phe 
165 


Leu 


Val 


Leu 


Ser 


Leu 
170 


Met 


Val 


He 


Ala 


Val 
175 


Leu 


Leu 


He 


Val 


Ser 
180 


Val 


Leu 


Ser 


Pro 


Tyr 
185 


He 


Leu 


Leu 


Met 


Gly 
190 


Ala 


He 


He 


Met 


Val 
195 


He 


Cys 


Phe 


He 


Tyr 
200 


Tyr 


Met 


Met 


Phe 


Lys 
205 


Lys 


Ala 


He 


Gly 


Val 
210 


Phe 


Lys 


Arg 


Leu 


Glu 
215 


Asn 


Tyr 


Ser 


Arg 


Ser 
220 


Pro 


Leu 


Phe 


Ser 


His 
225 


He 


Leu 


Asn 


Ser 


Leu 
230 


Gin 


Gly 


Leu 


Ser 


Ser 
235 


He 


His 


Val 


Tyr 


Gly 
240 


Lys 


Thr 


Glu 


Asp 


Phe 
245 


He 


Ser 


Gin 


Phe 


Lys 
250 


Arg 


Leu 


Thr 


Asp 


Ala 

255 


Gin 


Asn 


Asn 


Tyr 


Leu 
260 


Leu 


Leu 


Phe 


Leu 


Ser 
265 


Ser 


Thr 


Arg 


Trp 


Met 
270 


Ala 


Leu 


Arg 


Deu 


Glu 
275 


He 


Met 


Thr 


Asn 


Leu 
280 


Val 


Thr 


Leu 


Ala 


Val 
285 


Ala 


Leu 


Phe 


Val 


Ala 
290 


Phe 


Gly 


He 


Ser 


Ser 
295 


Thr 


Pro 


Tyr 


Ser 


Phe 
300 


Lys 


Val 


Met 


Ala 


Val 
305 


Asn 


He 


Val 


Leu 


Gin 
310 


Leu 


Ala 


Ser 


Ser 


Phe 
315 


Gin 


Ala 


Thr 


Ala 


Arg 
320 


He 


Gly 


Leu 


Glu 


Thr 

325 


Glu 


Ala 


Gin 


Phe 


Thr 
330 


Ala 


Val 


Glu 


Arg 


He 
335 


Leu 


Gin 


Tyr 


Met 


Lys 
340 


Met 


Cys 


Val 


Ser 


Glu 
345 


Ala 


Pro 


Leu 


His 


Met 
350 


Glu 


Gly 


Thr 


Ser 


Cys 
355 


Pro 


Gin 


Gly 


Trp 


Pro 
360 


Gin 


His 


Gly 


Glu 


He 
365 


He 


Phe 


Gin 


Asp 


Tyr 
370 


His 


Met 


Lys 


Tyr 


Arg 
375 


Asp 


Asn 


Thr 


Pro 


Thr 
380 


Val 


Leu 


His 


Gly 


He 
385 


Asn 


Leu 


Thr 


He 


Arg 
390 


Gly 


His 


Glu 


Val 


Val 
395 


Gly 


He 


Val 


Gly 


Arg 
400 


Thr 


Gly 


Ser 


Gly 


Lys 
405 


Ser 


Ser 


Leu 


Gly 


Met 
410 


Ala 


Leu 


Phe 


Arg 


Leu 
415 


Val 


Glu 


Pro 


Met 


Ala 
420 


Gly 


Arg 


He 


Leu 


He 
425 


Asp 


Gly 


Val 


Asp 


He 
430 


Cys 


Ser 


He 


Gly 


Leu 
435 


Glu 


Asp 


Leu 


Arg 


Ser 
440 


Lys 


Leu 


Ser 


Val 


He 
445 


Pro 


Gin 


Asp 


Pro 


Val 
450 


Leu 


Leu 


Ser 


Gly 


Thr 
455 


He 


Arg 


Phe 


Asn 


Leu 
460 


Asp 


Pro 


Phe 


Asp 


Arg 
465 


His 


Thr 


Asp 


Gin 


Gin 
470 


He 


Trp 


Asp 


Ala 


Leu 
475 


Glu 


Arg 


Thr 


Phe 


Leu 
480 


Thr 


Lys 


Ala 


He 


Ser 
485 


Lys 


Phe 


Pro 


Lys 


Lys 
490 


Leu 


His 


Thr 


Asp 


Val 
495 


Val 


Glu 


Asn 


Gly 


Gly 
500 


Tyr 


Phe 


Ser 


Val 


Gly 
505 


Glu 


Arg 


Gin 


Leu 


Leu 
510 


Cys 


He 


Ala 


Arg 


Ala 
515 


Val 


Leu 


Arg 


Asn 


Ser 
520 


Lys 


He 


He 


Leu 


He 
525 


Asp 


Glu 


Ala 


Thr 


Ala 
530 


Ser 


He 


Asp 


Met 


Glu 
535 


Thr 


Asp 


Thr 


Leu 


He 
540 


Gin 


Arg 


Thr 


He 


Arg 
545 


Glu 


Ala 


Phe 


Gin 


Gly 
550 


Cys 


Thr 


Val 


Leu 


Val 
555 
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He Ala His Arg Val Thr Thr Val 
560 

Val Met Gly Asn Gly Lys Val Val 
575 

Leu Arg Lys Lys Pro Gly Ser Leu 
590 

Ala Thr Ser Ser Leu Arg 
605 
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jj 
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Leu 


Phe 


Leu 
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He 
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cn 
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Met 
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Vd.1 


Pro 
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Leu 


Asn 


Pro 


Asn 


Leu 


He 


Leu 


Gly 


Tyr 


Thr 


Pro 


J. ic 
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Gin 


Lys 


Val 


Ser 


Thr 


Asp 


Thr 


Glu 


Glu 


Tyr 


Thr 
110 


Asn 


Glu 


Lys 


Dei 


uyss 


Pro 


Ser 


Asn 
125 


Phe 


Val 


Gly 


Ser 


Tyr 


Glu 


Leu 


Arg 
140 


Phe 


Phe 


Pro 


He 


Tyr 


Met 


Asp 


Ser 
155 


Arg 


Ala 


Gly 


Ala 


Gin 


Tyr 


Trp 


Ser 
170 


Ser 


Gly 


Phe 


Asp 


Ala 


Ala 


He 


He 
185 


Gin 


Leu 


Lys 


Glu 


Leu 


Glu 


Ser 


Thr 
200 


Lys 


Ala 


Val 


Val 


Glu 


He 


Asp 


Thr 
215 


Phe 


Pro 


Arg 


Val 


lie 


Ala 


Phe 


Ser 
230 


Pro 


Phe 


Gly 


Val 


Ala 


Glu 


Lys 


Glu 
245 


Lys 


Lys 


He 


Gly 


Leu 


His 


Asp 


Thr 
260 


Ala 


Phe 


Trp 


Thr 


Ser 


Leu 


He 


Phe 
275 


Leu 


Met 


Ser 


Thr 


Ala 


Ser 


Leu 


Leu 
290 


Phe 


Pro 


Gin 


Leu 


Leu 


Phe 


Phe 


Leu 
305 


Tyr 


Gly 


Leu 


Met 


Leu 


Thr 


Pro 


Leu 
320 


Phe 


Lys 


Lys 



Leu Asn 


Cys Asp 


Arg 


He 


Leu 


565 








570 


Glu Phe 


Asp Arg 


Pro 


Glu 


Val 


580 








585 


Phe Ala 


Ala Leu 


Met 


Ala 


Thr 


595 








600 


Gly Val 


Trp Arg 


Gin 


Thr 


Arg 


10 








15 


He Lys 


Cys Arg 


Thr 


Lys 


Lys 


25 








30 


Pro Leu 


Phe Phe 


Leu 


Phe 


Trp 


40 








45 


Pro Asn 


Lys Lys 


Tyr 


Glu 


Glu 


55 








60 


Met Asp 


Lys Phe 


Thr 


Leu 


Ser 


70 








75 


Val Thr 


Asn He 


Thr 


Ser 


Ser 


85 








90 


His Leu 


Pro Asp 


Val 


He 


He 


100 








105 


Glu Met 


Leu Thr 


Ser 


Ser 


Leu 


115 








120 


Val Val 


Phe Lys 


Asp 


Ser 


Met 


130 








'135 


Asp Met 


He Pro 


Val 


Ser 


Ser 


145 








150 


Cys Ser 


Lys Ser 


Cys 


Glu 


Ala 


160 








165 


Thr Val 


Leu Gin 


Ala 


Ser 


He 


175 








180 


Thr Asn 


Val Ser 


Leu 


Trp 


Lys 


190 








195 


He Met 


Gly Glu 


Thr 


Ala 


Val 


205 








210 


Gly Val 


He Leu 


He 


Tyr 


Leu 


220 








225 


Tyr Phe 


Leu Ala 


lie 


His 


He 


235 








240 


Lys Glu 


Phe Leu 


Lys 


He 


Met 


250 








255 


Leu Ser 


Trp Val 


Leu 


Leu 


Tyr 


265 








270 


Leu Leu 


Met Ala 


Val 


He 


Ala 


280 








285 


Ser Ser 


Ser He 


Val 


He 


Phe 


295 








300 


Ser Ser 


Val Phe 


Phe 


Ala 


Leu 


310 








315 


Ser Lys 


His Val 


Gly 


He 


Val 


325 








330 
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Glu 


Phe 


Phe 


He 


Leu 


He 


Pro 


Phe 


Cys 


His 


Leu 


Glu 


Ala 


Gly 


Pro 


Asn 


Ser 


He 


He 


Pro 


Gly 


Lys 


Pro 


Ser 


Ser 


Glu 


Gly 


Glu 


Pro 


Val 


Ser 


Gly 


lie 


Ala 


Leu 


Arg 


Ala 


Leu 


Leu 


lie 


Leu 


Cys 


Tyr 


Gly 


His 


Lys 


Met 


He 


Leu 


Thr 


Val 


He 


Pro 


Ala 


Asp 


Leu 


Asp 


Ser 


Gly 


Gly 


Gly 


Asn 


Pro 


Asp 


Pro 


Cys 


Lys 


Ala 


Asn 


Ala 


Asp 


He 


Leu 


Lys 


Cys 


He 


Gly 


Tyr 


Glu 


Ser 


Leu 


Leu 


Leu 


Gin 


Lys 


Asp 


Met 


His 


Ser 


Asn 


Leu 


Glu 


Asp 


Gin 


Ala 


Asp 



Val 


Thr 


Val 




335 




Glu 


Ser 


Phe 




350 




His 


Cys 


Thr 




365 




Asp 


Phe 


Asn 




380 




Tyr 


Pro 


Leu 




395 




Phe 


Tyr 


Val 




410 




Glu 


Phe 


Gly 




425 




Tyr 


Trp 


Ser 




440 




Asn 


Val 


Asn 




455 




Ser 


Ser 


Glu 




470 




Gin 


Lys 


Thr 




485 




Asn 


Leu 


Ser 




500 




Gly 


His 


Ser 




515 




Gly 


Leu 


Cys 




530 




Arg 


Val 


Ser 




545 




Gly 


He 


Cys 




560 




Glu 


Glu 


Asn 




575 




Asn 


Asn 


He 




590 




Met 


Gin 


Thr 




605 




Gin 


Lvs 


Arg 




620 




Lvs 


He 


Leu 




635 




Ser 


Arg 


His 




650 




Arg 


Val 


Thr 




665 




Leu 


Ala 


Asp 




680 




Val 


Gly 


Ser 




695 




Arg 


Leu 


Ser 




710 




Ser 


Ser 


Leu 




725 




Gin 


Asn 


Asp 




740 




Asp 


Lys 


Phe 




755 




Leu 


Gly 


Val 




770 




Val 


Phe 


Leu 




785 




Tyr 


Ser 


Val 



Ala Phe Gly 
Pro Lys Ser 
Phe Val He 
Glu Gly Ala 
He He Thr 
Leu Leu Ala 
Leu Arg Arg 
Lys Ser Lys 
Gly Asn He 
Phe Val Gly 
Tyr Arg Lys 
Phe Asp He 
Gly Thr Gly 
Pro Pro Ser 
Glu He Asp 
Pro Gin Leu 
Leu Ser He 
He Gin Glu 
He Lys Asp 
Lys Leu Ser 
Leu Leu Asp 
He Val Trp 
Val Phe Ser 
Arg Lys Ala 
Ser Met Phe 
Met Tyr He 
Val Lys Gin 
Gin Gin Leu 
Ser Gly Leu 
He Ser Tyr 
Lys Leu Glu 
Phe Thr Gin 



Phe He 


Gly 


340 




Leu Val 


Trp 


355 




Gly He 


Ala 


370 




Ser Phe 


Ser 


385 




He He 


Met 


400 




Val Tyr 


Leu 


415 




Ser Ser 


Leu 


430 




Arg Asn 


Tyr 


445 




Ser Phe 


Ser 


460 




Lys Glu 


Ala 


475 




Lys Gly 


Glu 


490 




Tyr Glu 


Gly 


505 




Lys Ser 


Thr 


520 




Asp Gly 


Phe 


535 




Glu Met 


Phe 


550 




Asp He 


His 


565 




Leu Ala 


Ser 


580 




Val Gin 


Lys 


595 




Asn Gin 


Ala 


610 




Leu Gly 


He 


625 




Glu Pro 


Thr 


640 




Asn Leu 


Leu 


655 




Thr His 


Phe 


670 




Val He 


Ser 


685 




Leu Lys 


Ser 


700 




Asp Lys 


Tyr 


715 




His He 


Pro 


730 




Val Tyr 


Ser 


745 




Phe Ser 


Ala 


760 




Gly Val 


Ser 


775 




Val Glu 


Ala 


790 




Gin Pro 


Leu 
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Met 
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345 


Leu 


Phe 


Ser 






360 


Gin 


Val 


Met 






375 


Asn 


Leu 


Thr 






390 


Leu 


Thr 


Leu 






405 


Asp 


Gin 


Val 






420 


Tyr 


Phe 


Leu 






435 


Glu 


Glu 


Leu 






450 


Glu 


lie 


He 






465 


He 


Arg 


He 






480 


Asn 


Val 


Glu 






495 


Gin 


He 


Thr 
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Leu 


Met 


Asn 






525 


Ala 


Ser 


He 






540 


Glu 


Ala 


Arg 
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Phe 
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Val 
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He 


Lys 


Gly 






585 


Val 


Leu 


Leu 






600 


Lys 


Lys 


Leu 
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Ala 


Val 


Leu 
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Ala 


Gly 


Met 
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Lys 


Tyr 


Arg 
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Met 


Asp 


Glu 
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Gin 


Gly 


Met 
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Lys 


Trp 


Gly 






705 


Cys 


Ala 


Thr 






720 


Gly 


Ala 


Thr 






735 


Leu 


Pro 


Phe 
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Leu 


Asp 


Ser 
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Met 


Thr 


Thr 
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Glu 


He 


Asp 
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800 


Met 


Asd 


Ser 


Lys Ser 








815 


Leu 


Ser 


Glu 


Thr Lys 








830 


Lys 


Gin 


Gin 


Met Tyr 








845 


Lvs 


Ara 


Glu 


Ser Lys 








860 


lie 


Phe 


Phe 


Thr Val 








875 


Phe 


Lys 


Asn 


Ala Val 








890 


Phe 


Leu 


Lys 


Pro Gly 








905 


Leu 


Leu 


Gin 


Asn Ser 








920 


Phe 


Phe 


Thr 


Ser Gin 








935 


Asp 




Val 


Ser Val 








950 


His 


Ser 


Glu 


Lvs Asd 








965 


Met 


Val 


Tvr* 


Cgr Lgu 








980 


Tvr 


Leu 


TVr 


His Leu 








995 


Thr 


Pro 


Phe 


Phe Gin 








1010 


Leu 


±yx 


Phe 


Gin Ala 








1025 


Pro 


Pro 


i.y j. 


Phe Ala 








1040 


Ala 




Thr 


Gin Leu 








1055 




lie 


Glv 


Gin Ala 








1070 


Leu 


lie 


Leu 


Met Leu 
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1085 


TVr 


Phe 


Tvr 


Thr Val 








1100 


Glv 
oxy 


Ayx 


Val 


Pro Ser 








1115 


Thr 


Phe 


Lys 


Lys He 








1130 


Tvr 


Ser 


Val 


Ala Ala 








1145 


Phe 


Phe 


Met 


Glv Tvx 








1160 


Cys 


He 


He 


He Pro 








1175 


Phe 


He 


Lys 


He Ser 








1190 


Tvr 


Asn 


Pro 


Trp Asp 








1205 


Leu 


Gin 


Cys 


Val Leu 








1220 


Lys 


Tyr 


Gly 


Gly Arg 








1235 


Leu 


Ser 


Thr 


Lys Ser 








1250 


Asn 


Glu 


Asp 


Glu Asp 








1265 









805 


Phe 


Asp 


Glu 


Met Glu 








820 


Ala 


Ser 


Leu 


Val Ser 








835 


Thr 


He 


Ala 


Lys Phe 








850 


Ser 


Val 


Arg 


Ser Val 








865 


Gin 


He 


Phe 


Met Phe 








880 


Val 


Pro 


He 


Lys Leu 








895 


Asp 


Lvs 


Pro 


His Lys 








910 


Ala 


Asp 


Ser 


Asp He 








925 


Asn 


He 


Met 


Val Thr 








940 


Ala 


Pro 


His 


Ser Ala 








955 


Tvx 


Val 


Phe 


Ala Ala 








970 


Pro 


He 


Leu 


Val Asn 








985 


Asn 


Val 


Thr 


Glu Thr 








1000 


Glu 


He 


Thr 


Asp He 








1015 


Ala 


Leu 


Leu 


Gly He 








1030 


Met 


Glu 


Asn 


Ala Glu 








1045 


Lys 


Leu 


Ser 


Gly Leu 








1060 


Val 


Val 


Asp 


He Pro 








1075 


Glv 


Ser 


Leu 


Leu Ala 








1090 


Lvs 


Phe 


Leu 


Ala Val 








1105 


Val 


He 


Leu 


Phe Thr 








1120 


Leu 


Asn 


Thr 


Lys Glu 








1135 


Leu 


Ala 


Cys 


He Ala 








1150 


Thr 


He 


Ala 


Thr He 








1165 


He 


Tyr 


Pro 


Leu Leu 








1180 


Trp 


Lys 


Asn 


Val Arg 








1195 


Arg 


Leu 


Ser 


Val Ala 








1210 


Trp 


He 


Phe 


Leu Leu 








1225 


Ser 


He 


Arg 


Lys Asp 








1240 


Lys 


Asn 


Arg 


Lys Leu 








1255 


Glu 


Asp 


Val 


Lys Ala 








1270 









810 


Gin 


Ser 


Leu 


Leu He 








825 


Thr 


Met 


Ser 


Leu Tip 








840 


His 


Phe 


Phe 


Thr Leu 








855 


Leu 


Leu 


Leu 


Leu Leu 








870 


Leu 


Val 


His 


His Ser 








885 


Val 


Pro 


Asp 


Leu Tyr 








900 


Tyr 


Lys 


Thr 


Ser Leu 








915 


Ser 


Asp 


Leu 


lie Ser 








930 


Met 


He 


Asn 


Asp Ser 








945 


Ala 


Leu 


Asn 


Val Val 








960 


Val 


Phe 


Asn 


Ser Thr 








975 


He 


He 


Ser 


Asn Tyr 








990 


He 


Gin 


He 


Trp Ser 








1005 


Val 


Phe 


Lys 


He Glu 








1020 


He 


Val 


Thr 


Ala Met 








1035 


Asn 


His 


Lys 


He Lys 








1050 


Leu 


Pro 


Ser 


Ala Tyr 








1065 


Leu 


Phe 


Phe 


He He 








1080 


Phe 


His 


Tyr 


Gly Leu 








1095 


Val 


Phe 


Cys 


Leu He 








1110 


Tyr 


He 


Ala 


Ser Phe 








1125 


Phe 


Trp 


Ser 


Phe He 








1140 


He 


Thr 


Glu 


He Thr 








1155 


Leu 


His 


Tyr 


Ala Phe 








1170 


Gly 


Cys 


Leu 


He Ser 








1185 


Lys 


Asn 


Val 


Asp Thr 








1200 


Val 


He 


Ser 


Pro Tyr 








1215 


Gin 


Tyr 


Tyr 


Glu Lys 








1230 


Pro 


Phe 


Phe 


Arg Asn 








1245 


Pro 


Glu 


Pro 


Pro Asp 








1260 


Glu 


Arg 


Leu 


Lys Val 








1275 
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Lys 


Glu 


Leu 


Met Gly 


Cys 


Gin 


Cys 


Cys Glu Glu 


Lys Pro Ser He 








1280 








1285 


1290 


Met 


Val 


Ser 


Asn Leu 


His 


Lys 


Glu 


Tyr Asp Asp 


Lys Lys Asp Phe 








1295 








1300 


1305 


Leu 


Leu 


Ser 


Arg Lys 


Val 


Lys 


Lys 


Val Ala Thr 


Lys Tyr He Ser 








1310 








1315 


1320 


Phe 


Cys 


Val 


Lys Lys 


Gly 


Glu 


He 


Leu Gly Leu 


Leu Gly Pro Asn 






1325 








1330 


1335 


Gly 


Ala 


Gly 


Lys Ser 


Thr 


He 


He 


Asn He Leu 


Val Gly Asp He 






1340 








1345 


1350 


Glu 


Pro 


Thr 


Ser Gly 


Gin 


Val 


Phe 


Leu Gly Asp 


Tyr Ser Ser Glu 








1355 








1360 


1365 


Thr 


Ser 


Glu 


Asp Asp 


Asp 


Ser 


Leu 


Lys Cys Met 


Gly Tyr Cys Pro 








1370 








1375 


1380 


Gin 


He 


Asn 


Pro Leu 


Trp 


Pro 


Asp 


Thr Thr Leu 


Gin Glu His Phe 








1385 








1390 


1395 


Glu 


He 


Tyr 


Gly Ala 


Val 


Lys 


Gly 


Met Ser Ala 


Ser Asp Met Lys 








1400 








1405 


1410 


Glu 


Val 


He 


Ser Arg 


He 


Thr 


His 


Ala Leu Asp 


Leu Lys Glu His 








1415 








1420 


1425 


Leu 


Gin 


Lys 


Thr Val 


Lys 


Lys 


Leu 


Pro Ala Gly 


He Lys Arg Lys 








1430 








1435 


1440 


Leu 


Cys 


Phe 


Ala Leu 


Ser 


Met 


Leu 


Gly Asn Pro 


Gin He Thr Leu 






1445 








1450 


1455 


Leu 


Asp 


Glu 


Pro Ser 


Thr 


Gly 


Met 


Asp Pro Lys 


Ala Lys Gin His 








1460 








1465 


1470 


Met 


Trp 


Arg 


Ala He 


Arg- 


Thr 


Ala 


Phe Lys Asn 


Arg Lys Arg Ala 








1475 








1480 


1485 


Ala 


He 


Leu 


Thr Thr 


His 


Tyr 


Met 


Glu Glu Ala 


Glu Ala Val Cys 








1490 








1495 


1500 


Asp 


Arg 


Val 


Ala He 


Met 


Val 


Ser 


Gly Gin Leu 


Arg Cys He Gly 








1505 








1510 


1515 


Thr 


Val 


Gin 


His Leu 


Lys 


Ser 


Lys 


Phe Gly Lys 


Gly Tyr Phe Leu 








1520 








1525 


1530 


Glu 


He 


Lys 


Leu Lys 


Asp 


Trp 


He 


Glu Asn Leu 


Glu Val Asp Arg 








1535 








1540 


1545 


Leu 


Gin 


Arg 


Glu He 


Gin 


Tyr 


He 


Phe Pro Asn 


Ala Ser Arg Gin 








1550 








1555 


1560 


Glu 


Ser 


Phe 


Ser Ser 


He 


Leu 


Ala 


Tyr Lys He 


Pro Lys Glu Asp 








1565 








1570 


1575 


Val 


Gin 


Ser 


Leu Ser 


Gin 


Ser 


Phe 


Phe Lys Leu 


Glu Glu Ala Lys 








1580 








1585 


1590 


His 


Ala 


Phe 


Ala lie 


Glu 


Glu 


Tyr 


Ser Phe Ser 


Gin Ala Thr Leu 








1595 








1600 


1605 


Glu 


Gin 


Val 


Phe Val 


Glu 


Leu 


Thr 


Lys Glu Gin 


Glu Glu Glu Asp 








1610 








1615 


1620 


Asn 


Ser 


Cys 


Gly Thr 


Leu 


Asn 


Ser 


Thr Leu Trp 


Trp Glu Arg Thr 








1625 








1630 


1635 


Gin 


Glu 


Asp 


■Arg Val 


Val 


Phe 














1640 
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Pro 


Leu 


Glu 


Glu 


Leu 


Leu 


Pro 


Gly 


Arg 


Gly 


Ser 


Gly 


Arg 


Leu 


Gly 










20 










25 










30 


Gly 


Pro 


Arg 


Gly 


Pro 


Arg 


Thr 


Ala 


Pro 


Gly 


Ala 


Val 


Gly 


Leu 


Gly 










35 










40 










45 


Pro 


Ala 


Ala 


Ala 


Gly 


Glu 


Glu 


Ala 


Trp 


Arg 


Arg 


Gly 


Arg 


Ala 


Ala 










50 










55 










60 


Pro 


Ser 


Arg 


Asp 


Asp 


Gin 


Arg 


Leu 


Arg 


Pro 


Met 


Ala 


Pro 


Gly 


Leu 










65 










70 










75 


Ser 


Glu 


Ala 


Gly 


Lys 


Leu 


Leu 


Gly 


Leu 


Glu 


Tyr 


Pro 


Glu 


Arg 


Gin 










80 










85 










90 


Arg 


Leu 


Ala 


Ala 


Ala 


Val 


Gly 


Phe 


Leu 


Thr 


Met 


Ser 


Gly 


Val 


He 








95 










100 










105 


Ser 


Met 


Ser 


Ala 


Pro 


Phe 


Phe 


Leu 


Gly 


Lys 


He 


lie 


Asp 


Ala 


He 










110 










115 










120 


Tvr 


Thr 


Asn 


Pro 


Thr 


Val 


Asp 


Tyr 


Ser 


Asp 


Asn 


Leu 


Thr 


Arg 


Leu 








125 










130 










135 


Cvs 


Leu 


Glv 


Leu 


Ser 


Ala 


Val 


Phe 


Leu 


Cys 


Gly 


Ala 


Ala 


Ala 


Asn 








140 










145 










150 


Ala 


lie 


Arg 


Val 


Tyr 


Leu 


Met 


Gin 


Thr 


Ser 


Gly 


Gin 


Arg 


He 


Val 










155 










160 










165 


Asn 


Arg 


Leu 


Arcr 


Thr 


Ser 


Leu 


Phe 


Ser 


Ser 


He 


Leu 


Arg 


Gin 


Glu 










170 










175 










180 


Val 


Ala 


Phe 


Phe 


Asp 


Lvs 


Thr 


Arg 


Thr 


Gly 


Glu 


Leu 


He 


Asn 


Arg 










185 










190 










195 


Leu 


Ser 


Ser 


Asp 


Thr 


Ala 


Leu 


Leu 


Gly 


Arg 


Ser 


Val 


Thr 


Glu 


Asn 










200 










205 










210 


Leu 


Ser 


Asp 


Gly 


Leu 


Arg 


Ala 


Gly 


Ala 


Gin 


Ala 


Ser 


Val 


Gly 


lie 










215 










220 










225 


Ser 


Met 


Met 


Phe 


Phe 


Val 


Ser 


Pro 


Asn 


Leu 


Ala 


Thr 


Phe 


Val 


Leu 










230 










235 










240 


Ser 


Val 


Val 


Pro 


Pro 


Val 


Ser 


He 


He 


Ala 


Val 


He 


Tyr 


Gly 


Arg 










245 










250 










255 


Tyr 


Leu 


Ara 


Lys 


Leu 


Thr 


Lys 


Val 


Thr 


Gin 


Asp 


Ser 


Leu 


Ala 


Gin 








260 










265 










270 


Ala 


Thr 


Gin 


Leu 


Ala 


Glu 


Glu 


Arg 


lie 


Gly 


Asn 


Val 


Arg 


Thr 


Val 










275 










280 










285 


Arg 


Ala 


Phe 


Glv 


Lvs 


Glu 


Met 


Thr 


Glu 


lie 


Glu 


Lys 


Tyr 


Ala 


Ser 










290 










295 










300 


Lys 


Val 


Asp 


His 


Val 


Met 


Gin 


Leu 


Ala 


Arg 


Lys 


Glu 


Ala 


Phe 


Ala 








305 










310 










315 


Arcj 


Ala 


Gly 


Phe 


Phe 


Gly 


Ala 


Thr 


Gly 


Leu 


Ser 


Gly 


Asn 


Leu 


He 








320 










325 










330 


Val 


Leu 


Ser 


Val 


Leu 


Tvr 


LVS 


Gly 


Gly 


Leu 


Leu 


Met 


Gly 


Ser 


Ala 










335 










340 










345 


His 


Met 


Thr 


Val 


Gly 


Glu 


Leu 


Ser 


Ser 


Phe 


Leu 


Met 


Tyr 


Ala 


Phe 










350 










355 










360 


Tro 


Val 


Gly 


lie 


Ser 


lie 


Gly 


Gly 


Leu 


Ser 


Ser 


Phe 


Tyr 


Ser 


Glu 






365 










370 










375 


Leu 


Met 


Lys 


Gly 


Leu 


Gly 


Ala 


Gly 


Gly 


Arg 


Leu 


Trp 


Glu 


Leu 


Leu 










380 










385 










390 


Glu 


Arg 


Glu 


Pro 


Lys 


Leu 


Pro 


Phe 


Asn 


Glu 


Gly 


Val 


He 


Leu 


Asn 








395 










400 










405 


Glu 


Lys 


Ser 


Phe 


Gin 


Gly 


Ala 


Leu 


Glu 


Phe 


Lys 


Asn 


Val 


His 


Phe 










410 










415 










420 


Ala 


Tyr 


Pro 


Ala 


Arg 


Pro 


Glu 


Val 


Pro 


He 


Phe 


Gin 


Asp 


Phe 


Ser 










425 










430 










435 


Leu 


Ser 


lie 


Pro 


Ser 


Gly 


Ser 


Val 


Thr 


Ala 


Leu 


Val 


Gly 


Pro 


Ser 










440 










445 










450 


Gly 


Ser 


Gly 


Lys 


Ser 


Thr 


Val 


Leu 


Ser 


Leu 


Leu 


Leu 


Arg 


Leu 


Tyr 




455 










460 










465 


Asp 


Pro 


Ala 


Ser 


Gly 


Thr 


He 


Ser 


Leu 


Asp 


Gly 


His 


Asp 


He 


Arg 








470 










475 










480 


Gin 


Leu 


Asn 


Pro 


Val 


Trp 


Leu 


Arg 


Ser 


Lys 


He 


Gly 


Thr 


Val 


Ser 
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485 






Gin 


Glu 


Pro 


He Leu 
500 


Phe Ser 


Cys 


Tyr 


Gly 


Ala 


Asp Asp 
515 


Pro Ser 


Ser 


Arg 


Val 


Ala 


Glu Val 
530 


Ala Asn 


Thr 


Pro 


Gin 


Gly 


Phe Asn 
545 


Thr Val 


Val 


Ser 


Gly 


Gly 


Gin Lys 
560 


Gin Arg 


He 


Lys 


Asn 


Pro 


Lys He 
575 


Leu Leu 


Leu 


Asp 


Ala 


Glu 


Asn Glu 
590 


Tyr Leu 


Val 


Met 


Asp 


Gly 


Arg Thr 
605 


Val Leu 


Val 


He 


Lys 


Asn 


Ala Asn 
620 


Met Val 


Ala 


Thr 


Glu 


Tyr 


Gly Lys 
635 


His Glu 


Glu 


He 


Tyr 


Arg 


Lys Leu 
650 


Met Asn 


Lys 
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Met 


Val 


Lys 


Lys 


Glu 


He 


Ser 


Val 


1 








5 








Leu 


Tyr 


Lys 


Asn 


Phe 
20 


Leu 


Lys 


Lys 


He 


Gly 


Leu 


Tyr 


Leu 
35 


Cys 


He 


Phe 


Arg 


Phe 


Pro 


Glu 


Gin 
50 


Pro 


Pro 


Lys 


Phe 


Asn 


Asp 


Ser 


Gly 
65 


Leu 


Val 


Val 


He 


Thr 


Gin 


Arg 


He 
80 


Met 


Asn 


Lys 


Lys 


Gly 


Arg 


Thr 


Val 
95 


lie 


Gly 


Thr 


He 


Glu 


Leu 


Pro 


Lys 
110 


Lys 


Tyr 


His 


Ser 


Asp 


Thr 


Phe 


Ser 
125 


Tyr 


Arg 


Leu 


He 


Pro 


Val 


He 


Lys 
140 


Glu 


His 


Ser 


Ala 


Met 


His 


Gly 


Glu 

155 


He 


Phe 


Cys 


Lys 


Gly 


Phe 


Val 


Ala 
170 


Phe 


Gin 


Ala 


Glu 


Val 


Thr 


Thr 


Asn 
185 


His 


Ser 


Val 


He 


Gly 


He 


Asn 


Met 
200 


Lys 


He 


Pro 


He 


Met 


Asn 


Glu 


Trp 


Phe 


His 


Phe 



490 495 



Ser He 


Ala 


Glu 


Asn 


He 


Ala 


505 










510 


Val Thr 


Ala 


Glu 


Glu 


He 


Gin 


520 










525 


Val Ala 


Phe 


He 


Arg 


Asn 


Phe 


535 










540 


Gly Glu 


Lys 


Gly 


Val 


Leu 


Leu 


550 










555 


Ala He 


Ala 


Arg 


Ala 


Leu 


Leu 


565 










570 


Asp Glu 


Ala 


Thr 


Ser 


Ala 


Leu 


580 










585 


Gin Glu 


Ala 


Leu 


Asp 


Arg 


Leu 


595 










600 


He Ala 


His 


Arg 


Leu 


Ser 


Thr 


610 










615 


Val Leu 


Asp 


Gin 


Gly 


Lys 


He 


625 










630 


Leu Leu 


Ser 


Lys 


Pro 


Asn 


Gly 


640 










645 


Gin Ser 


Phe 


He 


Ser 


Ala 




655 












Arg Gin 


Gin 


He 


Gin 


Ala 


Leu 


10 










15 


Trp Arg 


He 


Lys 


Arg 


Glu 


Phe 


25 










30 


Ser Glu 


His 


Phe 


Arg 


Ala 


Thr 


40 










45 


Val Leu 


Gly 


Ser 


Val 


Asp 


Gin 


55 










60 


Ala Tyr 


Thr 


Pro 


Val 


Ser 


Asn 


70 










75 


Met Ala 


Leu 


Ala 


Ser 


Phe 


Met 


85 










90 


Pro Asp 


Glu 


Glu 


Thr 


Met 


Asp 


100 










105 


Glu Met 


Val 


Gly 


Val 


He 


Phe 


115 










120 


Lys Phe 


Asn 


Trp 


Gly 


Tyr 


Arg 


130 










135 


Glu Tyr 


Thr 


Glu 


His 


Cys 


Trp 


145 










150 


Tyr Leu 


Ala 


Lys 


Tyr 


Trp 


Leu 


160 










165 


Ala He 


Asn 


Ala 


Ala 


He 


He 


175 










180 


Met Glu 


Glu 


Leu 


Thr 


Ser 


Val 


190 










195 


Pro Phe 


He 


Ser 


Lys 


Gly 


Glu 


205 










210 


Thr Cys 


Leu 


Val 


Ser 


Phe 


Ser 
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215 










220 










225 


Ser 


Phe 


He 


Tyr 


Phe 


Ala 


Ser 


Leu 


Asn Val 


Ala 


Arg 


Glu 


Arg 


Gly 








230 










235 










240 


Lys 


Phe 


Lys 


Lys 


Leu 


Met 


Thr 


Val 


Met Gly 


Leu 


Arg 


Glu 


Ser 


Ala 




245 










250 










255 


Phe 


Trp 


Leu 


Ser 


Trp 


Gly 


Leu 


Thr 


Tyr He 


Cys 


Phe 


He 


Phe 


He 








260 










265 










270 


Met 


Ser 


He 


Phe 


Met 
275 


Ala 


Leu 


Val 


He 


Thr 
280 


Ser 


He 


Pro 


He 


Val 
285 


Phe 


His 


Thr 


Gly 


Phe 
290 


Met 


Val 


He 


Phe 


Thr 
295 


Leu 


Tyr 


Ser 


Leu 


Tyr 
300 


Gly 


Leu 


Ser 


Leu 


He 


Ala 


Leu 


Ala 


Phe 


Leu 


Met 


Ser 


Val 


Leu 


He 








305 










310 










315 


Arg 


Lys 


Pro 


Met 


Leu 


Ala 


Gly 


Leu 


Ala Gly 


Phe 


Leu 


Phe 


Thr 


Val 






320 










325 










330 


Phe 


Trp 


Gly 


Cys 


Leu 
335 


Gly 


Phe 


Thr 


Val 


Leu 
340 


Tyr 


Arg 


Gin 


Leu 


Pro 
345 


Leu 


Ser 


Leu 


Gly 


Trp 


Val 


Leu 


Ser 


Leu 


Leu 


Ser 


Pro 


Phe 


Ala 


Phe 








350 










355 










360 


Thr 


Ala 


Gly 


Met 


Ala 
365 


Gin 


He 


Thr 


His 


Leu 
370 


Asp 


Asn 


Tyr 


Leu 


Ser 
375 


Gly 


Val 


He 


Phe 


Pro 


Asp 


Pro 


Ser 


Gly Asp 


Ser 


Tyr 


Lys 


Met 


He 








380 










385 










390 


Ala 


Thr 


Phe 


Phe 


He 


Leu 


Ala 


Phe 


Asp Thr 


Leu 


Phe 


Tyr 


Leu 


He 










395 










400 










405 


Phe 


Thr 


Leu 


Tyr 


Phe 


Glu 


Arg 


Val 


Leu 


Pro 


Asp 


Lys 


Asp 


Gly 


His 








410 










415 










420 


Gly 


Asp 


Ser 


Pro 


Leu 


Phe 


Phe 


Leu 


Lys 


Ser 


Ser 


Phe 


Trp 


Ser 


Lys 






425 










430 










435 


His 


Gin 


Asn 


Thr 


His 
440 


His 


Glu 


He 


Phe 


Glu 
445 


Asn 


Glu 


He 


Asn 


Pro 
450 


Glu 


His 


Ser 


Ser 


Asp 
455 


Asp 


Ser 


Phe 


Glu 


Pro 
460 


Val 


Ser 


Pro 


Glu 


Phe 
465 


His 


Gly 


Lys 


Glu 


Ala 
470 


He 


Arg 


He 


Arg 


Asn 
475 


Val 


He 


Lys 


Glu 


Tyr 
480 


Asn 


Gly 


Lys 


Thr 


Gly 
485 


Lys 


Val 


Glu 


Ala 


Leu 
490 


Gin 


Gly 


He 


Phe 


Phe 
495 


Asp 


He 


Tyr 


Glu 


Gly 


Gin 


He 


Thr 


Ala 


He 


Leu 


Gly 


His 


Asn 


Gly 








500 










505 










510 


Ala 


Gly 


Lys 


Ser 


Thr 


Leu 


Leu 


Asn 


He 


Leu 


Ser 


Gly 


Leu 


Ser 


Val 








515 










520 










525 


Ser 


Thr 


Glu 


Gly 


Ser 


Ala 


Thr 


He 


Tyr Asn 


Thr 


Gin 


Leu 


Ser 


Glu 










530 










535 










540 


lie 


Thr 


Asp 


Met 


Glu 
545 


Glu 


He 


Arg 


Lys 


Asn 
550 


He 


Gly 


Phe 


Cys 


Pro 
555 


Gin 


Phe 


Asn 


Phe 


Gin 
560 


Phe 


Asp 


Phe 


Leu 


Thr 
565 


Val 


Arg 


Glu 


Asn 


Leu 
570 


Arg 


Val 


Phe 


Ala 


Lys 


He 


Lys 


Gly 


He 


Gin 


Pro 


Lys 


Glu 


Val 


Glu 








575 










580 










585 


Gin 


Glu 


Val 


Leu 


Leu 
590 


Leu 


Asp 


Glu 


Pro 


Thr 
595 


Ala 


Gly 


Leu 


Asp 


Pro 
600 


Phe 


Ser 


Arg 


His 


Arg 
605 


Val 


Trp 


Ser 


Leu 


Leu 
610 


Lys 


Glu 


His 


Lys 


Val 
615 


Asp 


Arg 


Leu 


lie 


Leu 


Phe 


Ser 


Thr 


Gin 


Phe 


Met 


Asp 


Glu 


Ala 


Asp 






620 










625 










630 


He 


Leu 


Ala 


Asp 


Arg 
635 


Lys 


Val 


Phe 


Leu 


Ser 
640 


Asn 


Gly 


Lys 


Leu 


Lys 
645 


Cys 


Ala 


Gly 


Ser 


Ser 


Leu 


Phe 


Leu 


Lys Arg 


Lys 


Trp 


Gly 


He 


Gly 








650 










655 










660 


Tyr 


His 


Leu 


Ser 


Leu 


His 


Arg 


Asn 


Glu Met 


Cys 


Asp 


Thr 


Glu 


Lys 








665 










670 










675 


He 


Thr 


Ser 


Leu 


He 
680 


Lys 


Gin 


His 


He 


Pro 
685 


Asp 


Ala 


Lys 


Leu 


Thr 
690 
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Thr 


Glu 


Ser 


Glu Glu 








695 


Thr 


Asn 


Lys 


Phe Pro 








710 


Asp 


Gin 


Gly 


He Arg 








725 


Glu 


Val 


Phe 


Leu Asn 








740 


Asp 


Phe 


Asp 


He Gly 








755 


Thr 


Gly 


Asp 


Glu Ser 








770 


Glu 


Thr 


Arg 


Lys Ala 








785 


lie 


Tyr 


Ala 


Val Ala 








800 


Arg 


Arg 


Ala 


Leu Leu 








815 


He 


Pro 


He 


He Leu 








830 


Thr 


His 


Cys 


Trp Glu 








845 


Glu 


Gin 


He 


Pro Lys 








860 


Asn 


Thr 


Gly 


Ser Asn 








875 


Gin 


Asp 


He 


Val Leu 








890 


Ser 


Asp 


Asp 


Pro Ser 








905 


Gin 


Lys 


Asp 


Tyr Arg 








920 


Asn 


Cys 


Phe 


Pro Val 








935 


Gly 


He 


Phe 


Asn Phe 








950 


Phe 


Phe 


Arg 


Asp Asp 








965 


Ser 


He 


Phe 


Leu Leu 








980 


Gly 


He 


Ser 


Ser He 








995 


Ser 


He 


Leu 


Cys Gin 








1010 


Gly 


Leu 


Trp 


Pro Ser 








1025 


He 


Pro 


Leu 


His Phe 








1040 


Tyr 


Phe 


Ser 


Phe Leu 








1055 


Val 


Leu 


Val 


Val Cys 








1070 


Leu 


Met 


Tyr 


Val Leu 








1085 


Asn 


Gly 


Phe 


Trp Ser 








1100 


Ser 


Thr 


He 


Leu Val 








1115 


Leu 


Cys 


Met 


He Phe 








1130 


Leu 


Leu 


He 


Gin Leu 








1145 


Leu 


Asp 


Asn 


Arg He 



Lys 


Leu 


Val 


Tyr Ser 








700 


Asp 


Leu 


Tyr 


Ser Asp 








715 


Asn 


Tyr 


Ala 


Val Ser 








730 


Leu 


Glu 


Gly 


Lys Ser 








745 


Lys 


Gin 


Glu 


Lys He 








760 


Glu 


Met 


Glu 


Gin Val 








775 


Val 


Ser 


Ser 


Ala Ala 








790 


Thr 


Leu 


Arg 


Phe Leu 








805 


Cys 


Leu 


Leu 


Leu Val 








820 


Glu 


Lys 


He 


Met Tyr 








835 


Phe 


Ser 


Pro 


Ser Met 








850 


Thr 


Pro 


Leu 


Thr Ser 








865 


He 


Glu 


Asp 


Leu Val 








880 


Glu 


He 


Asp 


Asp Phe 








895 


Tyr 


Asn 


Gly 


Ala He 








910 


Phe 


Ser 


Val 


Ala Cys 








925 


Leu 


Met 


Gly 


He Val 








940 


Thr 


Glu 


Leu 


He Gin 








955 


He 


Val 


Leu 


Asp Leu 








970 


Leu 


He 


Thr 


Asn Cys 








985 


Ser 


Asp 


Tyr 


Lys He 








1000 


Lys 


Asn 


Val 


Gin Ser 








1015 


Ala 


Tyr 


Trp 


Cys Gly 








1030 


Leu 


He 


Leu 


Leu Ser 








1045 


Gly 


Phe 


Gin 


Leu Pro 








1060 


He 


He 


Gly 


Cys Ala 








1075 


Ser 


Phe 


He 


Phe Cys 








1090 


Phe 


Gly 


Phe 


Phe He 








1105 


Ser 


Thr 


Lys 


Tyr Glu 








1120 


He 


Pro 


Ser 


Phe Thr 








1135 


Asn 


Phe 


Met 


Tyr Met 








1150 


Asn 


Glu 


Val 


Asn Lys 



Leu 


Pro 


Leu 


Glu Lys 








705 


Leu 


Asp 


Lys 


Cys Ser 








720 


Val 


Thr 


Ser 


Leu Asn 








735 


Ala 


He 


Asp 


Glu Pro 








750 


His 


Val 


Thr 


Arg Asn 








765 


Leu 


Cys 


Ser 


Leu Pro 








780 


Leu 


Trp 


Arg 


Arg Gin 








795 


Lys 


Leu 


Arg 


Arg Glu 








810 


Leu 


Gly 


He 


Ala Phe 








825 


Lys 


Val 


Thr 


Arg Glu 








840 


Tyr 


Phe 


Leu 


Ser Leu 








855 


Leu 


Leu 


He 


Val Asn 








870 


His 


Ser 


Leu 


Lys Cys 








885 


Arg 


Asn 


Arg 


Asn Gly 








900 


He 


Val 


Ser 


Gly Asp 








915 


Asn 


Thr 


Lys 


Lys Ser 








930 


Ser 


Asn 


Ala 


Leu He 








945 


Met 


Glu 


Ser 


Thr Ser 








960 


Gly 


Phe 


He 


Asp Gly 








975 


He 


Ser 


Pro 


Tyr He 








990 


Pro 


Ser 


Ser 


He Pro 








1005 


Gin 


Leu 


Trp 


He Ser 








1020 


Gin 


Ala 


Leu 


Val Asp 








1035 


He 


His 


Leu 


He Tyr 








1050 


Trp 


Glu 


Leu 


Met Phe 








1065 


Ala 


Ser 


Leu 


He Phe 








1080 


Lys 


Trp 


Arg 


Lys Asn 








1095 


Val 


Leu 


lie 


Cys Val 








1110 


Lys 


Pro 


Asn 


Leu He 








1125 


Phe 


Leu 


Asp 


Met Ser 








1140 


Arg 


Asn 


Leu 


Asp Ser 








1155 


Thr 


He 


Leu 


Leu Thr 
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1160 






Asn 


Leu 


lie 


Pro Tyr 
1175 


Leu 


Gin 


Arg 


Cys 


Leu 


Glu Met 
1190 


Lys 


Tyr 


Pro 


Val 


Phe 


Arg He 
1205 


Ser 


Pro 


Pro 


Glu 


Glu 


Pro Glu 
1220 


Glu 


Asp 


Ala 


Asn 


Ala 


Leu Thr 
1235 


Thr 


Pro 


Thr 


Ala 


Ser 


Cys Leu 
1250 


His 


Lys 


Cys 


Phe 


Ser 


Thr Thr 
1265 


Lys 


Lys 


Phe 


Cys 


Val 


Lys Lys 
1280 


Gly 


Glu 


Gly 


Ala 


Gly 


Lys Ser 
1295 


Thr 


Ser 


Val 


Pro 


Thr 


Ala Gly 
1310 


Val 


Val 


Val 


Arg 


Gin 


Gin Arg 
1325 


Asp 


Asn 


Pro 


Gin 


Glu 


Asn Ser 
1340 


Leu 


Trp 


Leu 


Glu 


Leu 


Tyr Ala 
1355 


Ala 


Val 


Ala 


Leu 


Ser 


He Ser 
1370 


Arg 


Leu 


Gin 


Leu 


Lys 


Ala Pro 
1385 


Val 


Lys 


Lys 


Leu 


Cys 


Phe Val 
1400 


Leu 


Ser 


Leu 


Leu 


Asp 


Glu Pro 
1415 


Phe 


Thr 


Gin 


Met 


Trp 


Gin He 
1430 


Leu 


Gin 


Gly 


Thr 


Leu 


Leu Thr 
1445 


Thr 


His 


Cys 


Asp 


Arg 


Val Ala 
1460 


He 


Met 


Gly 


Ser 


lie 


Gin His 
1475 


Leu 


Lys 


Leu 


Glu 


He 


Lys Met 
1490 


Lys 


Glu 


Thr 


Glu 


lie 


Leu Lys 
1505 


Leu 


Phe 


Tyr 


Ser 


Ser 


Leu Met 
1520 


Ala 


Tyr 


Pro 


Leu 


Ser 


Arg Ala 
1535 


Phe 


Phe 


Phe 


Asn 


Leu 


Glu Glu 
1550 


Tyr 


Ser 


Val 


Phe 


Leu 


Glu Leu 
1565 


Cys 


Lys 


Asp 


Lys 


He 


Asp Thr 
1580 


Thr 


Val 


Asp 


Pro 











<210> 6 
<211> 382 
<212> PRT 





1165 




1170 


Ser 


Val He 


Phe Leu 


Phe Val He 




1180 




1185 


Gly 


Asn Glu 


He Met 


Asn Lys Asp 




1195 




1200 


Arg 


Ser Arg 


Gly Thr 


His Thr Asn 




1210 




1215 


Val 


Gin Ala 


Glu Arg 


Val Gin Ala 




1225 




1230 


Asn 


Leu Glu 


Glu Glu 


Pro Val He 




1240 




1245 


Glu 


Tyr Tyr 


Glu Thr 


Lys Lys Ser 




1255 




1260 


Lys 


Ala Ala 


He Arg 


Asn Val Ser 




1270 




1275 


Val 


Leu Gly 


Leu Leu 


Gly His Asn 




1285 




1290 


He 


Lys Met 


He Thr 


Gly Cys Thr 




1300 




1305 


Val 


Leu Gin 


Gly Asn 


Arg Ala Ser 




1315 




1320 


Ser 


Leu Lys 


Phe Leu 


Gly Tyr Cys 




1330 




1335 


Pro 


Lys Leu 


Thr Met 


Lys Glu His 




1345 




1350 


Lys 


Gly Leu 


Gly Lys 


Glu Asp Ala 




1360 




1365 


Val 


Glu Ala 


Leu Lys 


Leu Gin Glu 




1375 




1380 


Thr 


Leu Ser 


Glu Gly 


He Lys Arg 




1390 




1395 


He 


Leu Gly 


Asn Pro 


Ser Val Val 




1405 




1410 


Gly 


Met Asp 


Pro Glu 


Gly Gin Gin 




1420 




1425 


Ala 


Thr He 


Lys Asn 


Gin Glu Arg 




1435 




1440 


Tyr 


Met Ser 


Glu Ala 


Lys Ser Leu 




1450 




1455 


Val 


Ser Gly 


Thr Leu 


Arg Cys He 




1465 




1470 


Asn 


Lys Phe 


Gly Lys 


Asp Tyr Leu 




1480 




1485 


Pro 


Thr Gin 


Val Glu 


Ala Leu His 




1495 




1500 


Pro 


Gin Ala 


Ala Trp 


Gin Glu Arg 




1510 




1515 


Lys 


Leu Pro 


Val Glu 


Asp Val His 




1525 




1530 


Lys 


Leu Glu 


Ala Met 


Lys Gin Thr 




1540 




1545 


Leu 


Ser Gin 


Ala Thr 


Leu Glu Gin 




1555 




1560 


Glu 


Gin Glu 


Leu Gly 


Asn Val Asp 




1570 




1575 


Glu 


Trp Lys 


Leu Leu 


Pro Gin Glu 




1585 




1590 
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<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 6754139CD1 

<400> 6 

Met Asp Glu Arg Asn Gin Val Leu Thr Leu Tyr Leu Trp lie Arg 
15 10 15 

Gin Glu Tip Thr Asp Ala Tyr Leu Arg Trp Asp Pro Asn Ala Tyr 
20 25 30 

Gly Gly Leu Asp Ala lie Arg lie Pro Ser Ser Leu Val Trp Arg 
35 40 45 

Pro Asp lie Val Leu Tyr Asn Lys Ala Asp Ala Gin Pro Pro Gly 
50 55 60 

Ser Ala Ser Thr Asn Val Val Leu Arg His Asp Gly Ala Val Arg 
65 70 75 

Trp Asp Ala Pro Ala lie Thr Arg Ser Ser Cys Arg Val Asp Val 
80 85 90 

Ala Ala Phe Pro Phe Asp Ala Gin His Cys Gly Leu Thr Phe Gly 
95 100 105 

Ser Trp Thr His Gly Gly His Gin Val Asp Val Arg Pro Arg Gly 

110 115 120 

Ala Ala Ala Ser Leu Ala Asp Phe Val Glu Asn Val Glu Trp Arg 

125 130 135 

Val Leu Gly Met Pro Ala Arg Arg Arg Val Leu Thr Tyr Gly Cys 

140 145 150 

Cys Ser Glu Pro Tyr Pro Asp Val Thr Phe Thr Leu Leu Leu Arg 

155 160 165 

Arg Arg Ala Ala Ala Tyr Val Cys Asn Leu Leu Leu Pro Cys Val 

170 175 180 

Leu lie Ser Leu Leu Ala Pro Leu Ala Phe His Leu Pro Ala Asp 

185 190 195 

Ser Gly Glu Lys Val Ser Leu Gly Val Thr Val Leu Leu Ala Leu 

200 205 210 

Thr Val Phe Gin Leu Leu Leu Ala Glu Ser Met Pro Pro Ala Glu 

215 220 225 

Ser Val Pro Leu lie Gly Lys Tyr Tyr Met Ala Thr Met Thr Met 

230 235 240 

Val Thr Phe Ser Thr Ala Leu Thr lie Leu He Met Asn Leu His 

245 250 255 

Tyr Cys Gly Pro Ser Val Arg Pro Val Pro Ala Trp Ala Arg Ala 

260 265 270 

Leu Leu Leu Gly His Leu Ala Arg Gly Leu Cys Val Arg Glu Arg 

275 280 285 

Gly Glu Pro Cys Gly Gin Ser Arg Pro Pro Glu Leu Ser Pro Ser 

290 295 300 

Pro Gin Ser Pro Glu Gly Gly Ala Gly Pro Pro Ala Gly Pro Cys 

305 310 315 

His Glu Pro Arg Cys Leu Cys Arg Gin Glu Ala Leu Leu His His 

320 325 330 

Val Ala Thr He Ala Asn Thr Phe Arg Ser His Arg Ala Ala Gin 

335 340 345 

Arg Cys His Glu Asp Trp Lys Arg Leu Ala Arg Val Met Asp Arg 

350 355 360 

Phe Phe Leu Ala He Phe Phe Ser Met Ala Leu Val Met Ser Leu 

365 370 375 

Leu Val Leu Val Gin Ala Leu 

380 

<210> 7 
<211> 1115 
<212> PRT 
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<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 6996659CD1 



<400> 7 



Met 


Arg 


Ara 


Leu 


Ser 


Leu 


Trp 


Trp Leu Leu Ser Arg Val Cys Leu 


1 








5 






10 15 


Leu 


Leu 


Pro 


Pro 


Pro 


Cvs 


Ala 


Leu Val Leu Ala Gly Val Pro Ser 










20 






25 ' 30 


Ser 


Ser 


Ser 


His 


Pro 


Gin 


Pro 


Cys Gin He Leu Lys Arg He Gly 










35 






40 45 


His 


Ala 


Val 


Arg 


Val 


Glv 


Ala 


Val His Leu Gin Pro Trp Thr Thr 










50 






55 60 


Ala 


Pro 


Arg* 


Ala 


Ala 


Ser 


Arg 


Ala Pro Asp Asp Ser Arg Ala Gly 










65 






70 75 


Ala 


Gin 


Arg 


Asp 


Glu 


Pro 


Glu 


Pro Gly Thr Arg Arg Ser Pro Ala 










80 






85 90 


Pro 


Ser 


Pro 


Glv 


Ala 


Arg 


Trp 


Leu Gly Ser Thr Leu His Gly Arg 








95 






100 105 


filv 


Prn 


Pro 


Glv 


Ser 


Arg 


Lys 


Pro Gly Glu Gly Ala Arg Ala Glu 










110 






115 120 


Ala 


Leu 


Trr> 


Pro 


Arg 


Asp 


Ala 


Leu Leu Phe Ala Val Asp Asn lieu 










125 






130 135 


Asn 




Val 


Glu 


Glv 


Leu 


Leu 


Pro Tyr Asn Leu Ser Leu Glu Val 








140 






145 150 


Val 


Met 


Ala 


lie 


Glu 


Ala 


Glv 


Leu Gly Asp Leu Pro Leu Leu Pro 










155 






160 165 


Phe 


Ser 


Ser 


Pro 


Ser 


Ser 


Pro 


Trp Ser Ser Asp Pro Phe Ser Phe 










170 






175 180 


jueu 


oil! 


OCl 


Val 


V— _y o 


His 


Thr 


Val Val Val Gin Glv Val Ser Ala 










X O J 






190 195 


T.on 


T.on 


Ala 


Phe 


Pro 


Gin 


Ser 


Gin Gly Glu Met Met Glu Leu Asp 










200 






205 210 


DcU 


val 




jucu 


Val 


Leu 


His 


Tie Pro Val He Ser He Val Ara 










215 






220 225 


His 


Glu 


Phe 


Pro 


Arg 


Glu 


Ser 


Gin Asn Pro Leu His Leu Gin Leu 










230 






235 240 


Ser 


Leu 


Glu 


Asn 


Ser 


Leu 


Ser 


Ser Asp Ala Asp Val Thr Val Ser 










245 






250 255 


lie 


Leu 


Thr 


Met 


Asn 


Asn 


Trp 


Tyr Asn Phe Ser Leu Leu Leu Cys 










260 






265 270 


Gin 


Glu 


Asp 


Trp 


Asn 


He 


Thr 


Asp Phe Leu Leu Leu Thr Gin Asn 










275 






280 285 


Asn 


Ser 


Lys 
-* 


Phe 


His 


Leu 


Gly 


Ser He He Asn He Thr Ala Asn 








290 






295 300 


Levi 


Pro 


Ser 


Thr 


Gin 


Asp 


Leu 


Leu Ser Phe Leu Gin He Gin Leu 










305 




310 315 


Glu 


Ser 


lie 


Lys 


Asn 


Ser 


Thr 


Pro Thr Val Val Met Phe Gly Cys 








320 






325 330 


Asp 


Met 


Glu 


Ser 


He 


Arg 


Arg 


He Phe Glu He Thr Thr Gin Phe 








335 






340 345 


Gly 


Val 


Met 


Pro 


Pro 


Glu 


Leu 


Arg Trp Val Leu Gly Asp Ser Gin 








350 






355 360 


Asn 


Val 


Glu 


Glu 


Leu 


Arg 


Thr 


Glu Gly Leu Pro Leu Gly Leu He 










365 






370 375 


Ala 


His 


Gly 


Lys 


Thr 


Thr 


Gin 


Ser Val Phe Glu His Tyr Val Gin 










380 






385 390 


Asp 


Ala 


Met 


Glu 


Leu 


Val 


Ala 


Arg Ala Val Ala Thr Ala Thr Met 








395 






400 405 


lie 


Gin 


Pro 


Glu 


Leu 


Ala 


Leu 


He Pro Ser Thr Met Asn Cys Met 










410 






415 420 
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Glu 


Val 


Glu 


Thr 


Thr 
425 


Asn 


Leu 


Thr 


Ser 


Gly 
430 


Gin 


Tyr 


Leu 


Ser 


Arg 
435 


Phe 


Leu 


Ala 


Asn 


Thr 
440 


Thr 


Phe 


Arg 


Gly 


Leu 
445 


Ser 


Gly 


Ser 


He 


Arg 
450 


Val 


Lys 


Gly 


Ser 


Thr 
455 


He 


Val 


Ser 


Ser 


Glu 
460 


Asn 


Asn 


Phe 


Phe 


He 
465 


Trp 


Asn 


Leu 


Gin 


His 
470 


Asp 


Pro 


Met 


Gly 


Lys 
475 


Pro 


Met 


Trp 


Thr 


Arg 
480 


Leu 


Gly 


Ser 


Trp 


Gin 
485 


Gly 


Gly 


Lys 


He 


Val 
490 


Met 


Asp 


Tyr 


Gly 


He 
495 


Trp 


Pro 


Glu 


Gin 


Ala 
500 


Gin 


Arg 


His 


Lys 


Thr 
505 


His 


Phe 


Gin 


His 


Pro 
510 


Ser 


Lys 


Leu 


His 


Leu 
515 


Arg 


Val 


Val 


Thr 


Leu 
520 


lie 


Glu 


His 


Pro 


Phe 
525 


Val 


Phe 


Thr 


Arg 


Glu 
530 


Val 


Asp 


Asp 


Glu 


Gly 
535 


Leu 


Cys 


Pro 


Ala 


Gly 
540 


Gin 


Leu 


Cys 


Leu 


Asp 
545 


Pro 


Met 


Thr 


Asn 


Asp 
550 


Ser 


Ser 


Thr 


Leu 


Asp 
555 


Ser 


Leu 


Phe 


Ser 


Ser 
560 


Leu 


His 


Ser 


Ser 


Asn 
565 


Asp 


Thr 


Val 


Pro 


He 
570 


Lys 


Phe 


Lys 


Lys 


Cys 
575 


Cys 


Tyr 


Gly 


Tyr 


Cys 
580 


lie 


Asp 


Leu 


Leu 


Glu 
585 


Lys 


He 


Ala 


Glu 


Asp 
590 


Met 


Asn 


Phe 


Asp 


Phe 
595 


Asp 


Leu 


Tyr 


He 


Val 
600 


Gly 


Asp 


Gly 


Lys 


Tyr 
605 


Gly 


Ala 


Trp 


Lys 


Asn 
610 


Gly 


His 


Trp 


Thr 


Gly 
615 


Leu 


Val 


Gly 


Asp 


Leu 
620 


Leu 


Arg 


Gly 


Thr 


Ala 
625 


His 


Met 


Ala 


Val 


Thr 
630 


Ser 


Phe 


Ser 


He 


Asn 
635 


Thr 


Ala 


Arg 


Ser 


Gin 
640 


Val 


He 


Asp 


Phe 


Thr 

645 


Ser 


Pro 


Phe 


Phe 


Ser 
650 


Thr 


Ser 


Leu 


Gly 


He 
655 


Leu 


Val 


Arg 


Thr 


Arg 
660 


Asp 


Thr 


Ala 


Ala 


Pro 
665 


He 


Gly 


Ala 


Phe 


Met 
670 


Trp 


Pro 


Leu 


His 


Trp 
675 


Thr 


Met 


Trp 


Leu 


Gly 
680 


lie 


Phe 


Val 


Ala 


Leu 
685 


His 


He 


Thr 


Ala 


Val 
690 


Phe 


Leu 


Thr 


Leu 


Tyr 
695 


Glu 


Trp 


Lys 


Ser 


Pro 
700 


Phe 


Gly 


Leu 


Thr 


Ser 
705 


Lys 


Gly 


Arg 


Asn 


Arg 
710 


Ser 


Lys 


Val 


Phe 


Ser 
715 


Phe 


Ser 


Ser 


Ala 


Leu 
720 


Asn 


He 


Cys 


Tyr 


Ala 
725 


Leu 


Leu 


Phe 


Gly 


Arg 
730 


Thr 


Val 


Ala 


He 


Lys 
735 


Pro 


Pro 


Lys 


Cys 


Trp 
740 


Thr 


Gly 


Arg 


Phe 


Leu 
745 


Met 


Asn 


Leu 


Trp 


Ala 
750 


lie 


Phe 


Cys 


Met 


Phe 
755 


Cys 


Leu 


Ser 


Thr 


Tyr 
760 


Thr 


Ala 


Asn 


Leu 


Ala 
765 


Ala 


Val 


Met 


Val 


Gly 
770 


Glu 


Lys 


He 


Tyr 


Glu 
775 


Glu 


Leu 


Ser 


Gly 


He 
780 


His 


Asp 


Pro 


Lys 


Leu 
785 


His 


His 


Pro 


Ser 


Gin 
790 


Gly 


Phe 


Arg 


Phe 


Gly 
795 


Thr 


Val 


Arg 


Glu 


Ser 
800 


Ser 


Ala 


Glu 


Asp 


Tyr 
805 


Val 


Arg 


Gin 


Ser 


Phe 
810 


Pro 


Glu 


Met 


His 


Glu 
815 


Tyr 


Met 


Arg 


Arg 


Tyr 
820 


Asn 


Val 


Pro 


Ala 


Thr 
825 


Pro 


Asp 


Gly 


Val 


Glu 
830 


Tyr 


Leu 


Lys 


Asn 


Asp 
835 


Pro 


Glu 


Lys 


Leu 


Asp 
840 


Ala 


Phe 


lie 


Met 


Asp 
845 


Lys 


Ala 


Leu 


Leu 


Asp 
850 


Tyr 


Glu 


Val 


Ser 


He 
855 


Asp 


Ala 


Asp 


Cys 


Lys 
860 


Leu 


Leu 


Thr 


Val 


Gly 
865 


Lys 


Pro 


Phe 


Ala 


He 
870 


Glu 


Gly 


Tyr 


Gly 


He 
875 


Gly 


Leu 


Pro 


Pro 


Asn 
880 


Ser 


Pro 


Leu 


Thr 


Ala 
885 


Asn 


He 


Ser 


Glu 


Leu 


He 


Ser 


Gin 


Tyr 


Lys 


Ser 


His 


Gly 


Phe 


Met 
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890 








Asp 


Met 


Leu 


His Asp 
905 


Lys 


Trp 


Tyr 


Arg 


Ser 


Phe 


Ala Val 
920 


Thr 


Glu 


Thr 


Phe 


Ser 


Gly 


Leu Phe 
935 


Val 


Leu 


Leu 


lie 


Leu 


Thr 


Thr He 
950 


Gly 


Glu 


His 


Pro 


Arg 


He 


Lys Asn 
965 


Lys 


Ser 


Lys 


Ser 


Gin 


Arg 


Leu His 
980 


Arg 


Ala 


He 


Lys 


Gin 


Gin 


His Phe 
995 


Lys 


Thr 


Lys 


Val 


Gly 


Pro 


Arg Gin 
1010 


Leu 


Thr 


Val 


His 


Asp 


Asn 


Arg Arg 
1025 


Lys 


Tyr 


He 


Asn 


Gin 


Leu 


Gly He 
1040 


Arg 


He 


His 


Arg 


Arg 


Arg 


Glu Leu 
1055 


Pro 


Ala 


Leu 


Asp 


Ser 


Leu 


Asn Val 
1070 


Ser 


Arg 


Asn 


Glu 


Leu 


Glu 


Lys Gin 
1085 


He 


Gin 


Val 


Ala 


Val 


Ser 


Arg Lys 
1100 


Thr 


Glu 


Leu 


Arg 


Thr 


Cys 


Glu Ser 
1115 









<210> 8 
<211> 295 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7472747CD1 



<400> 8 














Met 


Pro 


Ser 


Ala 


Gly 


Leu 


Cys 


Ser 


1 








5 








Pro 


Leu 


Leu 


Leu 


Ala 
20 


Tyr 


Val 


Cys 


He 


Phe 


Gin 


Leu 


Leu 
35 


Glu 


Arg 


Gin 


Gin 


Phe 


Gin 


Leu 


Glu 
50 


Lys 


Leu 


Arg 


Leu 


Asp 


Gin 


Trp 


Ala 
65 


Met 


Glu 


Gin 


Ala 


Trp 


Val 


Lys 


Gly 
80 


Val 


Asn 


Pro 


Ser 


Asn 


Trp 


Asp 


Phe 
95 


Gly 


Ser 


Ser 


Val 


Thr 


Thr 


He 


Gly 
110 


Tyr 


Gly 


Asn 


Gly 


Gin 


Val 


Phe 


Cys 
125 


Val 


Phe 


Tyr 


Asn 


Val 


He 


Phe 


Leu 
140 


Asn 


His 


Leu 


Leu 


Ala 


Ala 


He 


Glu 


Arg 


Trp 


Glu 



895 






900 


Arg Val 


Val Pro 


Cys 


Gly Lys 


910 






915 


Leu Gin 


Met Gly 


He 


Lys His 


925 






930 


Cys He 


Gly Phe 


Gly 


Leu Ser 


940 






945 


He Val 


Tyr Arg 


Leu 


Leu Leu 


955 






960 


Leu Gin 


Tyr Trp 


Leu 


His Thr 


970 






975 


Asn Thr 


Ser Phe 


He 


Glu Glu 


985 






990 


Arg Val 


Glu Lys 


Arg 


Ser Asn 


1000 






1005 


Trp Asn 


Thr Ser 


Asn 


Leu Ser 


1015 






1020 


Phe Ser 


Asp Glu 


Glu 


Gly Gin 


1030 






1035 


Gin Asp 


He Pro 


Leu 


Pro Pro 


1045 






1050 


Arg Thr 


Thr Asn 


Gly 


Lys Ala 


1060 






1065 


Ser Val 


Met Gin 


Glu 


Leu Ser 


1075 






1080 


He Arg 


Gin Glu 


Leu 


Gin Leu 


1090 






1095 


Glu Glu 


Tyr Gin 


Arg 


Thr Ser 


1105 






1110 



Cys 


Trp 


Gly Gly 


Arg 


Val 


Leu 




10 








15 


Tyr 


Leu 


Leu Leu 


Gly 


Ala 


Thr 




25 








30 


Ala 


Glu 


Ala Gin 


Ser 


Arg 


Asp 




40 








45 


Phe 


Leu 


Glu Asn 


Tyr 


Thr 


Cys 




55 








60 


Phe 


Val 


Gin Val 


He 


Met 


Glu 




70 








75 


Lys 


Gly 


Asn Ser 


Thr 


Asn 


Pro 




85 








90 


Phe 


Phe 


Phe Ala 


Gly 


Thr 


Val 




100 








105 


Leu 


Ala 


Pro Ser 


Thr 


Glu 


Ala 




115 








120 


Ala 


Leu 


Leu Gly 


He 


Pro 


Leu 




130 








135 


Gly 


Thr 


Gly Leu 


Arg 


Ala 


His 




145 








150 


Asp 


Arg 


Pro Arg 


Arg 


Ser 


Gin 
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155 



Glu 


Val 


Leu 


Gin Val 
170 


Leu 


Gly 


Leu Ala 


Thr 


Leu 


Val 


lie Leu 
185 


He 


Phe 


Pro Pro 


Glu 


Glv 


Tro 


Ser Phe 
200 


Ser 


Glu 


Gly Phe 


Leu 


Ser 


Thr 


He Gly 
215 


Phe 


Gly 


Asp Tyr 


Ser 


Lys 


His 


Tyr He 
230 


Ser 


Val 


Tyr Arg 


lie 


Leu 


Leu 


Gly Leu 
245 


Ala 


Trp 


Leu Ala 


Pro 


Leu 


Leu 


Leu His 
260 


Arg 


Cys 


Cys Gin 


Gly 


Leu 


Gly 


Val Lys 
275 


Asp 


Gly 


Ala Ala 


Pro 


Arg 


Pro 


Gin Lys 


He 


Pro 


He Ser 



290 

<210> 9 
<211> 384 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7474121CD1 

<400> 9 



Met 


Glu 


Val 


Ser 


Gly 


His 


Pro 


Gin Ala 


1 








5 








Ala 


Leu 


Gly 


Lys 


Leu 
20 


Phe 


Pro 
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Thr 
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Val 
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Leu 
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Glu 
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Cys 


Arg 
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Asp 


Arg 


Lys 
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Gin 
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Leu Gin 


Lys 


Pro 


Gin 


Trp 


Phe 
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Arg 


Thr Thr 
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Leu 


Phe 


Phe 


Cys 
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Thr 


Val Phe 


Tyr 


He 


Tyr 


Pro 


Val 
125 


Thr 
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Leu Gly 


Tyr 


Ala 


Leu 


Phe 


Gly 
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He 
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Leu Met 


Thr 
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He 
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155 
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He Leu 


Phe 
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Lys 
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Phe 
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Val 
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Tyr 
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Thr 
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Ala Gly Thr Asp 
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Trp 
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Leu 
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Leu 
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Arg 
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Ser 


Asp Pro Ser Gly 
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Arg Cys Cys Pro 
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Cys 
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Val 
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75 
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His Leu Gin Lys 
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85 




90 
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Ser 
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Thr Val Gly Tyr 


Gly 
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Lys 


Tyr Leu Cys Met 


Leu 
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Phe 
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Thr Ser Tyr Asn 
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Trp 
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Asp Pro Lys Pro 


Ala 
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Ser 
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Lys Gin Asn Thr 
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Ser 


Asn Ser Cys Pro 
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Val 


Leu Gly 


Arg 
260 


Leu Ser Tyr Ser 
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Val 


Gly Gin 


Gin 
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Val Glu Arg Leu 
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Ala 


Leu He 


Val 
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Phe Ala Tyr He 
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Pro 


Phe Trp 


Glu 
305 


Thr Gin Leu Asp 


Phe 


Cys 


Phe Val 


Thr 


Leu Thr Thr He 
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His Pro 


Asn 
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Met Glu 
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Glu 
65 


Phe 


Glu 
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Lys 
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Arg 
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Val Asn 
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Val 
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He 


He Ser Asn Leu 


Asp 
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Asp 


He Pro Leu Pro 


He 


280 




285 
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Cys Ala Ala Ala 


He 
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300 


Phe 


Glu Asn Ala Phe 


Tyr 


310 




315 


Gly 


Phe Gly Asp Thr 


Val 


325 




330 


Phe 


Ser He Tyr He 


He 


340 




345 


Phe 


Lys Leu Val Gin 


Asn 


355 




360 


Met 


Leu Phe Phe Ala 


Lys 


370 




375 


Thr 


Glu He Arg Asn 


Thr 
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Gin Pro Cys Pro 
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Ala 


Pro Asp Met Ser 
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Ala Tyr Ser Leu 
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He 
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Gly Met Ala Phe 
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Leu Tyr Ser Ser 
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Gly Val His Gin 


Met 
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He 


Leu Val Gly Asn 


He 
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Gin Val Phe Asn 
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Ala Met Glu Ala 
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Phe Val Ala He 
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Met 


Thr Ala Ala Gly 
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Gin 


He 


Leu 


He 


Ser 
260 


Val 


Leu 


Pro 


Ser 


Tyr 


Thr 


Gly 
275 


Pro 


Gly 


He 


Cys 


Lys 


Asn 


Leu 
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Pro 
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Ala 
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He 


Ser 


Gly 
305 


Ala 


Phe 


Ala 


Arg 


Tyr 


Met 
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320 


Lys 


He 


He 


Val 


Val 


Val 


Val 
335 


Ala 


Thr 


Pro 


Lys 


Lys 


Tyr 
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350 


Met 


Gin 


Phe 


Pro 


Thr 


Pro 


Val 
365 


Ser 


Pro 


He 


Gly 


Thr 


Ala 


Phe 
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Leu 


Leu 


Ala 


Met 


Gly 


Arg 
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Leu 
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Asn 


Gin 


Glu 
410 


Met 


He 


Gly 


Ser 


Phe 


Phe 


Lys 
425 
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Leu 
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Val 


Asp 
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Ala 
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Ala 


Leu 


He 


Ala 
455 
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Thr 


Asp 


Pro 


Tyr 


Tyr 
470 
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Trp 


lie 


Trp 


Val 


Val 
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485 
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Tyr 
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Val 


Ala 


Val 
500 
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Val 
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Gin 


Thr 


Gin 


Phe 
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Arg 


Asn 


Asp 


Thr 


Asp 


He 


Tyr 
530 


Val 


Asn 


Asp 


He 


Gin 


Gly 


He 
545 


Lys 


He 


Phe 


Ala 
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Ser 


Glu 
560 


He 


Phe 


Val 


Ser 


Leu 


Gin 


Glu 
575 


Leu 


Gin 


Thr 


Asp 


Pro 


Asn 


Asn 
590 
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Gin 


Ser 


Tyr 


He 


Thr 


Phe 
605 


Ser 


Pro 


Glu 


Pro 


Pro 


Ala 


Ser 
620 


Ala 


Glu 


Leu 


Ala 


Ser 


Val 


Pro 
635 
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Phe 


Asp 


Met 
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Gly 


Val 
650 


Ser 


Phe 


Leu 


Ala 
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Ser 
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Thr 


Phe 
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Val 


Asn 


He 
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Ala 
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Val 


Phe 


Glu 
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Gly 
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Asp 
710 


Ala 


Val 
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Lys 


Tyr He 
265 


Phe 


Gly 


Leu 


Thr 


He 
270 


Ser 


He Val 
280 


Phe 


Thr 


Phe 


lie 


Asp 
285 


Thr 


Asn He 
295 


Ala 


Ser 


Leu 


He 


Phe 
300 


Leu 


Val Leu 
310 


Val 


Lys 


Glu 


Leu 


Asn 
315 


Arg 


Phe Pro 
325 


He 


Pro 


Thr 


Glu 


Met 
330 


Ala 


He Ser 
340 


Gly 


Gly 


Cys 


Lys 


Met 
345 


lie 


Val Gly 
355 


Glu 


He 


Gin 


Arg 


Gly 
360 


Val 


Val Ser 
370 


Gin 


Trp 


Lys 


Asp 


Met 
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Ala 


He Val 
385 


Ser 


Tyr 


Val 


He 


Asn 
390 


Ala 


Asn Lys 
400 
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Gly 


Tyr 


Asp 


Val 
405 


Ala 


Leu Gly 
415 


Cys 


Ser 


Asn 


Phe 


Phe 
420 


Val 


He Cys 
430 


Cys 


Ala 


Leu 


Ser 


Val 
435 


Gly 


Gly Lys 
445 


Ser 


Gin 


Ser 


Val 


Leu 
450 


Leu 


Lys Asn 
460 


Ser 


Leu 


Lys 


Gin 


Leu 
465 


Arg 


Lys Ser 
475 


Lys 


Leu 


Asp 


Cys 


Cys 
480 


Ser 


Ser Phe 
490 


Phe 


Leu 


Ser 


Leu 


Pro 
495 


Ala 


Phe Ser 
505 


Val 


Leu 


Val 


Val 


Val 
510 
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Tyr Ala 
520 


Leu 


Ala 


Gin 


Val 


Met 
525 


Pro 


Lys Thr 
535 


Tyr 


Asn 


Arg 


Ala 


Gin 
540 


He 


Thr Tyr 
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Cys 


Ser 


Pro 


Leu 


Tyr 
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Arg 


Gin Lys 
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Val 


He 


Ala 


Lys 


Thr 
570 


Gin 


Asp Phe 
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Glu 


Asn 


Ala 


Pro 


Pro 
585 


Thr 


Pro Ala 
595 


Asn 


Gly 


Thr 


Ser 


Val 
600 


Asp 


Ser Ser 
610 


Ser 


Pro 


Ala 


Gin 


Ser 
615 


Ala 


Pro Gly 
625 


Glu 


Pro 


Ser 


Asp 


Met 
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Val 


Thr Phe 
640 
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Thr 


Leu 


He 


Leu 
645 


Val 


Asp Leu 
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Met 


Gly 


He 


L ys 


Ala 
660 


Tyr 


Gly Lys 
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He 
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Val 


Lys 


Val 
675 


Gin 


Val Tyr 
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Asn 


Asp 


He 


Ser 


His 
690 


Ser 


Leu Glu 
700 


Cys 


Lys 


His 


Val 


Phe 
705 


Leu 


Phe Ala 
715 


Gin 


Ala 


Asn 


Ala 


Arg 
720 
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Asp Val Thr Pro Gly His Asn Phe Gin Gly Ala Pro Gly Asp Ala 
725 730 735 

Glu Leu Ser Leu Tyr Asp Ser Glu Glu Asp He Arg Ser Tyr Trp 
740 745 750 

Asp Leu Glu Gin Glu Met Phe Gly Ser Met Phe His Ala Glu Thr 
755 760 765 

Leu Thr Ala Leu 
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680 


Val 


Gin 


Thr 


Ala 


Arg 
685 


Ala Arg His 


Arg 


Glu 
690 


Leu 


Glu 


Ala Thr 


Asp 
695 


Ala 


His 


Leu 













<210> 14 
<211> 766 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7478795CD1 

<400> 14 



Met 


Arg 


Leu 


Trp 


Lys 


Ala 


Val 


Val 


Val 


Thr 


Leu Ala Phe 


Met 


Ser 


1 








5 










10 






15 


Val 


Asp 


He 


Cys 


Val 
20 


Thr 


Thr 


Ala 


He 


Tyr 
25 


Val Phe Ser 


His 


Leu 
30 


Asp 


Arg 


Ser 


Leu 


Leu 
35 


Glu 


Asp 


He 


Arg 


His 
40 


Phe Asn He 


Phe 


Asp 
45 


Ser 


Val 


Leu 


Asp 


Leu 
50 


Trp 


Ala 


Ala 


Cys 


Leu 
55 


Tyr Arg Ser 


Cys 


Leu 
60 


Leu 


Leu 


Gly 


Ala 


Thr 
65 


He 


Gly 


Val 


Ala 


Lys 
70 


Asn Ser Ala 


Leu 


Gly 
75 


Pro 


Arg 


Arg 


Leu 


Arg 
80 


Ala 


Ser 


Trp 


Leu 


Val 
85 


He Thr Leu 


Val 


Cys 
90 


Leu 


Phe 


Val 


Gly 


He 
95 


Tyr 


Ala 


Met 


Val 


Lys 
100 


Leu Leu Leu 


Phe 


Ser 
105 


Glu 


Val 


Arg 


Arg 


Pro 
110 


He 


Arg 


Asp 


Pro 


Trp 
115 


Phe Trp Ala 


Leu 


Phe 
120 


Val 


Trp 


Thr 


Tyr 


He 
125 


Ser 


Leu 


Gly 


Ala 


Ser 
130 


Phe Leu Leu 


Trp 


Trp 
135 


Leu 


Leu 


Ser 


Thr 


Val 
140 


Arg 


Pro 


Gly 


Thr 


Gin 
145 


Ala Leu Glu 


Pro 


Gly 
150 


Ala 


Ala 


Thr 


Glu 


Ala 
155 


Glu 


Gly 


Phe 


Pro 


Gly 
160 


Ser Gly Arg 


Pro 


Pro 
165 


Pro 


Glu 


Gin 


Ala 


Ser 
170 


Gly 


Ala 


Thr 


Leu 


Gin 
175 


Lys Leu Leu 


Ser 


Tyr 
180 


Thr 


Lys 


Pro 


Asp 


Val 
185 


Ala 


Phe 


Leu 


Val 


Ala 
190 


Ala Ser Phe 


Phe 


Leu 
195 


He 


Val 


Ala 


Ala 


Leu 
200 


Gly 


Glu 


Thr 


Phe 


Leu 
205 


Pro Tyr Tyr 


Thr 


Gly 
210 


Arg 


Ala 


He 


Asp 


Gly 
215 


He 


Val 


He 


Gin 


Lys 
220 


Ser Met Asp 


Gin 


Phe 

225 


Ser 


Thr 


Ala 


Val 


Val 
230 


He 


Val 


Cys 


Leu 


Leu 
235 


Ala He Gly 


Ser 


Ser 
240 
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Phe Ala 


Ala 


Gly 


He 


Arg 


Gly 


Gly 


He 


Phe 


Thr Leu 


He 


Phe 


Ala 








245 










250 








255 


Arg Leu 


Asn 


lie 


Arg 


Leu 


Arg 


Asn 


Cys 


Leu 


Phe Arg 


Ser 


Leu 


Val 








260 










265 








270 


Ser Gin 


Glu 


Thr 


Ser 


Phe 


Phe 


Asp 


Glu 


Asn 


Arg Thr 


Gly 


Asp 


Leu 








275 










280 








285 


lie Ser 


Arg 


Leu 


Thr 


Ser 


Asp 


Thr 


Thr 


Met 


Val Ser 


Asp 


Leu 


Val 








290 










295 








300 


Ser Gin 


Asn 


He 


Asn 


Val 


Phe 


Leu 


Arg 


Asn 


Thr Val 


Lys 


Val 


Thr 








305 










310 








315 


Gly Val 


Val 


Val 


Phe 


Met 


Phe 


Ser 


Leu 


Ser 


Trp Gin 


Leu 


Ser 


Leu 






320 










325 








330 


Val Thr 


Phe 


Met 


Gly 


Phe 


Pro 


He 


He 


Met 


Met Val 


Ser 


Asn 


He 








335 










340 








345 


Tvr Glv 


LVS 


Tyr 


Tyr 


Lys 


Arg 


Leu 


Ser 


Lys 


Glu Val 


Gin 


Asn 


Ala 






350 










355 








360 


Leu Ala 


Arcr 


Ala 


Ser 


Asn 


Thr 


Ala 


Glu 


Glu 


Thr He 


Ser 


Ala 


Met 






365 










370 








375 


Lys Thr 


Val 


Arg 


Ser 


Phe 


Ala 


Asn 


Glu 


Glu 


Glu Glu 


Ala 


Glu 


Val 






380 










385 








390 


Tyr Leu 


Arcr 


Lvs 


Leu 


Gin 


Gin 


Val 


Tyr 


Lys 


Leu Asn 


Arg 


Lys 


Glu 






395 










400 








405 


Ala Ala 


Ala 


Tvr 


Met 


Tvr 


Tvr 


Val 


Trp 


Gly 


Ser Gly 


Leu 


Thr 


Leu 






410 










415 








420 


Leu Val 


Val 


Gin 


Val 


Ser 


He 


Leu 


Tyr 


Tyr 


Gly Gly 


His 


Leu 


Val 








425 










430 








435 


lie Ser 


Gly 


Gin 


Met 


Thr 


Ser 


Gly 


Asn 


Leu 


He Ala 


Phe 


He 


He 








440 










445 








450 


Tyr Glu 


Phe 


Val 


Leu 


Gly 


Asp 


Cys 


Met 


Glu 


Ser Val 


Gly 


Ser 


Val 






455 










460 








465 


Tyr Ser 


Gly 


Leu 


Met 


Gin 


Gly 


Val 


Gly 


Ala 


Ala Glu 


Lys 


Val 


Phe 




470 










475 








480 


Glu Phe 


lie 


Asp 


Arg 


Gin 


Pro 


Thr 


Met 


Val 


His Asp 


Gly 


Ser 


Leu 








485 










490 








495 


Ala Pro 


Asp 


His 


Leu 


Glu 


Gly 


Arg 


Val 


Asp 


Phe Glu 


Asn 


Val 


Thr 






500 










505 








510 


Phe Thr 


Tvr 


Arcr 


Thr 


Arg 


Pro 


His 


Thr 


Gin 


Val Leu 


Gin 


Asn 


Val 






515 










520 








525 


Ser Phe 


Ser 


Leu 


Ser 


Pro 


Glv 

-< 


Lvs 


Val 


Thr 


Ala Leu 


Val 


Gly 


Pro 








530 










535 








540 


Ser Gly 


Ser 


Glv 


Lys 


Ser 


Ser 


Cvs 


Val 


Asn 


He Leu 


Glu 


Asn 


Phe 






545 










550 








555 


Tyr Pro 


Leu 


Glu 


Glv 


Gly 


Ara 


Val 


Leu 


Leu 


Asp Gly 


Lys 


Pro 


He 






560 










565 








570 


Ser Ala 


Tyr 


Asp 


His 


Lys 


Tyr 


Leu 


His 


Arg 


Val He 


Ser 


Leu 


Val 








575 










580 








585 


Ser Gin 


Glu 


Pro 


Val 


Leu 


Phe 


Ala 


Arg 


Ser 


He Thr 


Asp 


Asn 


He 








590 










595 








600 


Ser Tyr 


Gly 


Leu 


Pro 


Thr 


Val 


Pro 


Phe 


Glu 


Met Val 


Val 


Glu 


Ala 






605 










610 








615 


Ala Gin 


Lys 


Ala 


Asn 


Ala 


His 


Gly 


Phe 


He 


Met Glu 


Leu 


Gin 


Asp 








620 










625 








630 


Gly Tyr 


Ser 


Thr 


Glu 


Thr 


Gly 


Glu 


Lys 


Gly 


Ala Gin 


Leu 


Ser 


Gly 






635 










640 








645 


Gly Gin 


Lys 


Gin 


Arg 


Val 


Ala 


Met 


Ala 


Arg 


Ala Leu 


Val 


Arg 


Asn 






650 










655 








660 


Pro Pro 


Val 


Leu 


He 


Leu 


Asp 


Glu 


Ala 


Thr 


Ser Ala 


Leu 


Asp 


Ala 








665 










670 








675 


Glu Ser 


Glu 


Tyr 


Leu 


He 


Gin 


Gin 


Ala 


He 


His Gly 


Asn 


Leu 


Gin 






680 










685 








690 


Lys His 


Thr 


Val 


Leu 


He 


lie 


Ala 


His 


Arg 


Leu Ser 


Thr 


Val 


Glu 






695 










700 








705 


His Ala 


His 


Leu 


lie 


Val 


Val 


Leu 


Asp 


Lys 


Gly Arg 


Val 


Val 


Gin 
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710 715 720 

Gin Gly Thr His Gin Gin Leu Leu Ala Gin Gly Gly Leu Tyr Ala 

725 730 735 

Lys Leu Val Gin Arg Gin Met Leu Gly Leu Gin Pro Ala Ala Asp 

740 745 750 

Phe Thr Ala Gly His Asn Glu Pro Val Ala Asn Gly Ser His Lys 

755 760 765 

Ala 



<210> 15 
<211> 450 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc__feature 

<223> Incyte ID No: 656293CD1 

<400> 15 

Met Gly Leu Arg Ser His His Leu Ser Leu Gly Leu Leu Leu Leu 
15 10 15 

Phe Leu Leu Pro Ala Glu Cys Leu Gly Ala Glu Gly Arg Leu Ala 
20 25 30 

Leu Lys Leu Phe Arg Asp Leu Phe Ala Asn Tyr Thr Ser Ala Leu 
35 40 45 

Arg Pro Val Ala Asp Thr Asp Gin Thr Leu Asn Val Thr Leu Glu 
50 55 60 

Val Thr Leu Ser Gin lie lie Asp Met Asp Glu Arg Asn Gin Val 
65 70 75 

Leu Thr Leu Tyr Leu Trp lie Arg Gin Glu Trp Thr Asp Ala Tyr 
80 85 90 

Leu' Arg Trp Asp Pro Asn Ala Tyr Gly Gly Leu Asp Ala lie Arg 
95 100 105 

He Pro Ser Ser Leu Val Trp Arg Pro Asp He Val Leu Tyr Asn 

110 115 120 

Lys Ala Asp Ala Gin Pro Pro Gly Ser Ala Ser Thr Asn Val Val 

125 130 135 

Leu Arg His Asp Gly Ala Val Arg Trp Asp Ala Pro Ala He Thr 

140 145 150 

Arg Ser Ser Cys Arg Val Asp Val Ala Ala Phe Pro Phe Asp Ala 

155 160 165 

Gin His Cys Gly Leu Thr Phe Gly Ser Trp Thr His Gly Gly His 

170 175 180 

Gin Leu Asp Val Arg Pro Arg Gly Ala Ala Ala Ser Leu Ala Asp 

185 190 195 

Phe Val Glu Asn Val Glu Trp Arg Val Leu Gly Met Pro Ala Arg 

200 205 210 

Arg Arg Val Leu Thr Tyr Gly Cys Cys Ser Glu Pro Tyr Pro Asp 

215 220 225 

Val Thr Phe Thr Leu Leu Leu Arg Arg Arg Ala Ala Ala Tyr Val 

230 235 240 

Cys Asn Leu Leu Leu Pro Cys Val Leu He Ser Leu Leu Ala Pro 

245 250 255 

Leu Ala Phe His Leu Pro Ala Asp Ser Gly Glu Lys Val Ser Leu 

260 265 270 

Gly Val Thr Val Leu Leu Ala Leu Thr Val Phe Gin Leu Leu Leu 

275 280 285 

Ala Glu Ser Met Pro Pro Ala Glu Ser Val Pro Leu He Gly Lys 

290 295 300 

Tyr Tyr Met Ala Thr Met Thr Met Val Thr Phe Ser Thr Ala Leu 

305 310 315 

Thr He Leu He Met Asn Leu His Tyr Cys Gly Pro Ser Val Arg 
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320 



Pro 


Val 


Pro 


Ala Trp 


Ala 


Arg 


Ala Leu 








335 








Arg 


Gly 


Leu 


Cys Val 


Arg 


Glu 


Arg Gly 








350 








Arg 


Pro 


Pro 


Glu Leu 


Ser 


Pro 


Ser Pro 








365 








Ala 


Gly 


Pro 


Pro Ala 


Gly 


Pro 


Cys His 








380 








Arg 


Gin 


Glu 


Ala Leu 


Leu 


His 


His Val 








395 








Phe 


Arg 


Ser 


His Arg 


Ala 


Ala 


Gin Arg 








410 








Arg 


Leu 


Ala 


Arg Val 


Met 


Asp 


Arg Phe 








425 








Ser 


Met 


Ala 


Leu Val 


Met 


Ser 


Leu Leu 



440 



<210> 16 
<211> 260 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7473957CD1 

<400> 16 



Met 


Pro 


He 


Leu 


Ala 


Asn 


Leu 


Pro Gly 


1 








5 








Met 


Glu 


Phe 


Thr 


Ser 
20 


Ser 


Gly 


Ser Ala 


Val 


Thr 


Gly 


Ser 


Leu 
35 


Glu 


Thr 


Lys Tyr 


Leu 


Thr 


Phe 


Thr 


Glu 
50 


Lys 


Trp 


Asn Thr 


Glu 


He 


Thr 


Val 


Glu 
65 


Asp 


Gin 


Leu Ala 


Phe 


Asp 


Ser 


Ser 


Phe 
80 


Ser 


Pro 


Asn Thr 


He 


Lys 


Thr 


Gly 


Tyr 
95 


Lys 


Arg 


Glu His 


Met 


Asp 


Phe 


Asp 


He 
110 


Ala 


Gly 


Pro Ser 


Leu 


Gly 


Tyr 


Glu 


Gly 
125 


Trp 


Leu 


Ala Gly 


Thr 


Ala 


Lys 


Ser 


Arg 
140 


Val 


Thr 


Gin Ser 


Lys 


Thr 


Asp 


Glu 


Phe 
155 


Gin 


Leu 


His Thr 


Glu 


Phe 


Gly 


Gly 


Ser 
170 


He 


Tyr 


Gin Lys 


Thr 


Ala 


Val 


Asn 


Leu 
185 


Ala 


Trp 


Thr Ala 


Phe 


Gly 


He 


Ala 


Ala 
200 


Lys 


Tyr 


Gin He 


Ser 


Ala 


Lys 


Val 


Asn 
215 


Asn 


Ser 


Ser Leu 


Gin 


Thr 


Leu 


Lys 


Pro 
230 


Gly 


He 


Lys Leu 


Asp 


Gly 


Lys 


Asn 


Val 


Asn 


Ala 


Gly Gly 



245 
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325 




330 


Leu 


Leu Gly His Leu 


Ala 


340 




345 


Glu 


Pro Cys Gly Gin 


Ser 


355 




360 


Gin 


Ser Pro Glu Gly 


Gly 


370 




375 


Glu 


Pro Arg Cys Leu 


Cys 


385 




390 


Ala 


Thr He Ala Asn 


Thr 


400 




405 


Cys 


His Glu Asp Trp 


Lys 


415 




420 


Phe 


Leu Ala He Phe 


Phe 


430 




435 


Val 


Leu Val Gin Ala 


Leu 


445 




450 



Met 


Ser Ser Pro Arg 


Ala 


10 




15 


Asn 


Thr Glu Thr Thr 


Lys 


25 




30 


Arg 


Trp Thr Glu Tyr 


Gly 


40 




45 


Asp 


Asn Thr Leu Gly 


Thr 


55 




60 


Arg 


Gly Leu Lys Leu 


Thr 


70 




75 


Gly 


Lys' Lys Asn Ala 


Lys 


85 




90 


He 


Asn Leu Gly Cys 


Asp 


100 




105 


He 


Arg Gly Ala Leu 


Val 


115 




120 


Tyr 


Gin Met Asn Phe 


Glu 


130 




135 


Asn 


Phe Ala Val Gly 


Tyr 


145 




150 


Asn 


Val Asn Asp Gly 


Thr 


160 




165 


Val 


Asn Lys Lys Leu 


Glu 


175 




180 


Gly 


Asn Ser Asn Thr 


Arg 


190 




195 


Asp 


Pro Asp Ala Cys 


Phe 


205 




210 


He 


Gly Leu Gly Tyr 


Thr 


220 




225 


Thr 


Leu Ser Ala Leu 


Leu 


235 




240 


His 


Lys Leu Gly Leu 


Gly 


250 




255 
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Leu Glu Phe Gin Ala 
260 

<210> 17 

<211> 506 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7474111CD1 

<400> 17 



Met 


Ser* 


Glu 


Pro 


Glu 


Leu 


Gly Ser Gly Gin Phe Leu Glu Lys Ala 


1 








5 




10 15 


Leu 


Gin 


Thr 


Pro 


Ser 
20 


Val 


Pro Ala Pro Glu Ser Thr Leu Gly Phe 
25 30 


Glu 


Pro 


filv 


Leu 


Leu 
35 


Lys 


Gly Ala Leu Gly Thr Ala Gin Phe He 
40 45 


Pro 


Met 


Ala 


Gin 


Glv 
50 


Arg 


Thr Arg Glu Gin Ala Ser Arg Arg Trp 
55 60 


Ala 


Pro 


Arg 


Ser 


Pro 
65 


Ala 


Leu Arg Thr Pro Pro Arg His Tyr Gly 
70 "* 75 


Piro 


Glu 






Glv 
80 


Arg 


Thr Ala Ser Ara Glv Glv Glu Pro Glu 
85 90 


Val 


Gin 


Glv 


Glv 


Ala 
95 


Pro 


Glv Asn Pro Ser Pro Ser Lvs Pro Glv 
100 ~ 105 




Pro 


Gin 


Glv 


Val 
110 


Glv 


Pro Ala Ala Tm Glu Ara Ala Pro Ara 
115 120 


Pro 


Arg 




Ala 


Gin 
125 


Pro 


Ser Glv Ala Ara Val Glv Glu Ara Thr 
130 135 


Gin 


Pro 


Arg 


Ser 


Gin 
140 


Pro 


Val Gly Leu Ser Arg Gly Ala Gly Glu 
145 " J 150 


Asp 


Ser* 


Pro 


Ala 


Thr 
155 


Arg 


Ser Gly Ala Ala Ser Val Val Leu Asn 
160 165 


Val 


Glv 


Glv 


Ala 


Arg 
170 


Tvr 


Ser Leu Ser Arg Glu Leu Leu Lys Asp 
175 " 180 


Phe 


Pro 


Leu 


Arg 


Arg 
185 


Val 


Ser Arg Leu His Gly Cys Arg Ser Glu 
190 195 


Arg 


Asp 


Val 


Leu 


Glu 
200 


Val 


Cys Asp Asp Tyr Asp Arg Glu Arg Asn 
205 210 


Glu 


Tyx 


Phe 


Phe 


Asp 
215 


Arg 


His Ser Glu Ala Phe Gly Phe lie Leu 
220 225 


Leu 


Tyr 


Ala 


Ala 


Pro 
230 


Ser 


Arg Arg Trp Leu Glu Arg Met Arg Arg 
235 240 


Thr 


Phe 


Glu 


Glu 


Pro 
245 


Thr 


Ser Ser Leu Ala Ala Gin He Leu Ala 
250 255 


Ser 


Val 


Ser 


Val 


Val 
260 


Phe 


Val He Val Ser Met Val Val Leu Cys 
265 270 


Ala 


Ser 


Thr 


Leu 


Pro 
275 


Asp 


Trp Arg Asn Ala Ala Ala Asp Asn Arg 
280 285 


Ser 


Leu 


Asp 


Asp 


Arg 
290 


Ser 


Arg He He Glu Ala He Cys He Gly 
295 300 


Trp 


Phe 


Thr 


Ala 


Glu 
305 


Cys 


He Val Arg Phe He Val Ser Lys Asn 
310 315 


Lys 


Cys 


Glu 


Phe 


Val 
320 


Lys 


Arg Pro Leu Asn He He Asp Leu Leu 
325 330 


Ala 


lie 


Thr 


Pro 


Tyr 
335 


Tyr 


He Ser Val Leu Met Thr Val Phe Thr 
340 345 


Gly 


Glu 


Asn 


Ser 


Gin 
350 


Leu 


Gin Arg Ala Gly Val Thr Leu Arg Val 
355 360 


Leu 


Arg 


Met 


Met 


Arg 
365 


lie 


Phe Trp Val He Lys Leu Ala Arg His 
370 375 
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Phe 


lie 


Gly Leu 


Gin 


Thr Leu Gly 


Leu 


Thr 


Leu Lys 


Arg Cys 


Tyr 








380 






385 






390 


Arg 


Glu 


Met Val 


Met 


Leu Leu Val 


Phe 


He 


Cys Val 


Ala Met 


Ala 








395 






400 






405 


He 


Phe 


Ser Ala 


Leu 


Ser Gin Leu 


Leu 


Glu 


His Gly 


Leu Asp 


Leu 








410 






415 






420 


Glu 


Thr 


Ser Asn 


Lys 


Asp Phe Thr 


Ser 


He 


Pro Ala 


Ala Cys 


Trp 








425 






430 






435 


Trp 


Val 


He He 


Ser 


Met Thr Thr 


Val 


Gly 


Tyr Gly 


Asp Met 


Tyr 








440 






445 






450 


Pro 


He 


Thr Val 


Pro 


Gly Arg He 


Leu Gly 


Gly Val 


Cys Val 


Val 








455 






460 






465 


Ser 


Gly 


He Val 


Leu 


Leu Ala Leu 


Pro 


He 


Thr Phe 


He Tyr 


His 








470 






475 






480 


Ser 


Phe 


Val Gin 


Cys 


Tyr His Glu 


Leu Lys 


Phe Arg 


Ser Ala 


Arg 








485 






490 






495 


Tyr 


Ser 


Arg Ser 


Leu 


Ser Thr Glu 


Phe 


Leu 


Asn 












500 






505 









<210> 18 

<211> 506 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc__ feature 

<223> Incyte ID No: 7480826CD1 

<400> 18 



Met 


Lys 


Lys 


Ala 


Glu 


Met 


Gly 


Arg 


Phe Ser 


He Ser 


Pro Asp 


Glu 


1 








5 








10 






15 


Asp 


Ser 


Ser 


Ser 


Tyr 
20 


Ser 


Ser 


Asn 


Ser Asp 
25 


Phe Asn 


Tyr Ser 


Tyr 
30 


Pro 


Thr 


Lys 


Gin 


Ala 
35 


Ala 


Leu 


Lys 


Ser His 
40 


Tyr Ala 


Asp Val 


Asp 
45 


Pro 


Glu 


Asn 


Gin 


Asn 
50 


Phe 


Leu 


Leu 


Glu Ser 
55 


Asn Leu 


Gly Lys 


Lys 
60 


Lys 


Tyr 


Glu 


Thr 


Glu 


Phe 


His 


Pro 


Gly Thr 


Thr Ser 


Phe Gly 


Met 






65 








70 






75 


Ser 


Val 


Phe 


Asn 


Leu 
80 


Ser 


Asn 


Ala 


He Val 
85 


Gly Ser 


Gly He 


Leu 
90 


Gly 


Leu 


Ser 


Tyr 


Ala 
95 


Met 


Ala 


Asn 


Thr Gly 
100 


He Ala 


Leu Phe 


He 
105 


He 


Leu 


Leu 


Thr 


Phe 
110 


Val 


Ser 


He 


Phe Ser 
115 


Leu Tyr 


Ser Val 


His 
120 


Leu 


Leu 


Leu 


Lys 


Thr 
125 


Ala 


Asn 


Glu 


Gly Gly 
130 


Ser Leu 


Leu Tyr 


Glu 
135 


Gin 


Leu 


Gly 


Tyr 


Lys 
140 


Ala 


Phe 


Gly 


Leu Val 
145 


Gly Lys 


Leu Ala 


Ala 
150 


Ser 


Gly 


Ser 


He 


Thr 
155 


Met 


Gin 


Asn 


He Gly 
160 


Ala Met 


Ser Ser 


Tyr 
165 


Leu 


Phe 


He 


Val 


Lys 
170 


Tyr 


Glu 


Leu 


Pro Leu 
175 


Val He 


Gin Ala 


Leu 
180 


Thr 


Asn 


He 


Glu 


Asp 
185 


Lys 


Thr 


Gly 


Leu Trp 
190 


Tyr Leu 


Asn Gly 


Asn 
195 


Tyr 


Leu 


Val 


Leu 


Leu 
200 


Val 


Ser 


Leu 


Val Val 
205 


He Leu 


Pro Leu 


Ser 
210 


Leu 


Phe 


Arg 


Asn 


Leu 
215 


Gly 


Tyr 


Leu 


Gly Tyr 
220 


Thr Ser 


Gly Leu 


Ser 
225 


Leu 


Leu 


Cys 


Met 


Val 
230 


Phe 


Phe 


Leu 


He Val 
235 


Val He 


Cys Lys 


Lys 
240 


Phe 


Gin 


Val 


Pro 


Cys 
245 


Pro 


Val 


Glu 


Ala Ala 
250 


Leu He 


He Asn 


Glu 
255 
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Thr 


lie 


Asn 


Thr 


Thr 


Leu 


Thr 


Gin Pro Thr Ala Leu Val Pro 


Ala 










260 






265 


270 


Leu 


Ser 


His 


Asn 


Val 


Thr 


Glu 


Asn Asp Ser Cys Arg Pro His 


Tyr 










275 






280 


285 


Phe 


He 


Phe 


Asn 


Ser 


Gin 


Thr 


Val Tyr Ala Val Pro He Leu 


He 










290 






295 


300 


Phe 


Ser 


Phe Val 


Cys 


His 


Pro 


Ala Val Leu Pro He Tyr Glu 


Glu 










305 






310 


315 


Leu 


Lvs 


Asp Arg 


Ser 


Arg 


Arg 


Arg Met Met Asn Val Ser Lys 


He 








320 






325 


330 


Ser 


Phe 


Phe Ala 


Met 


Phe 


Leu 


Met Tyr Leu Leu Ala Ala Leu 


Phe 










335 






340 


345 


Gly 


Tyr 


Leu 


Thr 


Phe 


Tyr 


Glu 


His Val Glu Ser Glu Leu Leu 


His 






350 






355 


360 


Thr 


Tvr 


Ser 


Ser 


He 


Leu 


Gly 


Thr Asp He Leu Leu Leu He 


Val 








365 






370 


375 


Arg 


Leu 


Ala Val 


Leu 


Met 


Ala 


Val Thr Leu Thr Val Pro Val 


Val 








380 






385 


390 


He 


Phe 


Pro 


He 


Arg 


Ser 


Ser 


Val Thr His Leu Leu Cys Ala 


Ser 










395 






400 


405 


Lys 


Asp 


Phe 


Ser 


Trp 


Trp 


Arg 


His Ser Leu He Thr Val Ser 


He 






410 






415 


420 


Leu 


Ala 


Phe 


Thr 


Asn 


Leu 


Leu 


Val He Phe Val Pro Thr He 


Arg 










425 






430 


435 


Asp 


He 


Phe 


Gly 


Phe 


He 


Gly 


Ala Ser Ala Ala Ser Met Leu 


He 








440 






445 


450 


Phe 


He 


Leu 


Pro 


Ser 


Ala 


Phe 


Tyr He Lys Leu Val Lys Lys 


Glu 










455 






460 


465 


Pro 


Met 


Lys 


Ser 


Val 


Gin 


Lys 


He Gly Ala Leu Phe Phe Leu 


Leu 










470 






475 


480 


Ser 


Gly 


Val 


Leu 


Val 


Met 


Thr 


Gly Ser Met Ala Leu He Val 


Leu 








485 






490 


495 


Asp 


Trp 


Val 


His 


Asn 


Ala 


Pro 


Gly Gly Gly His 












500 






505 





<210> 19 

<211> 315 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 6025572CD1 

<400> 19 



Met 


His 


Arg 


Glu 


Pro 


Ala 


Lys 


Lys Lys Ala Glu Lys Arg Leu 


Phe 


1 








5 






10 


15 


Asp 


Ala 


Ser 


Ser 


Phe 


Gly 


Lys 


Asp Leu Leu Ala Gly Gly Val 


Ala 








20 






25 


30 


Ala 


Ala 


Val 


Ser 


Lys 


Thr 


Ala 


Val Ala Pro He Glu Arg Val 


Lys 










35 






40 


45 


Leu 


Leu 


Leu 


Gin 


Val 


Gin 


Ala 


Ser Ser Lys Gin He Ser Pro 


Glu 










50 






55 


60 


Ala 


Arg 


Tyr 


Lys 


Gly 


Met 


Val 


Asp Cys Leu Val Arg He Pro 


Arg 










65 






70 


75 


Glu 


Gin 


Gly 


Phe 


Phe 


Ser 


Phe 


Trp Arg Gly Asn Leu Ala Asn 


Val 








80 






85 


90 


He 


Arg 


Tyr 


Phe 


Pro 


Thr 


Gin 


Ala Leu Asn Phe Ala Phe Lys 


Asp 








95 






100 


105 


Lys 


Tyr 


Lys 


Gin 


Leu 


Phe 


Met 


Ser Gly Val Asn Lys Glu Lys 


Gin 










110 






115 


120 


Phe 


Trp 


Arg 


Trp 


Phe 


Leu 


Ala 


Asn Leu Ala Ser Gly Gly Ala 


Ala 




125 






130 


135 
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Gly 


Ala 


Thr 


Ser Leu 
140 


Cys 


Val 


Val 


Thr 


Arg 


Leu 


Gly Val 
155 


Asp 


He 


Gly 


Phe 


Lys 


Gly 


Leu Gly 
170 


Asp 


Cys 


He 


Gly 


He 


Ala 


Gly Leu 
185 


Tyr 


Gin 


Gly 


lie 


He 


Val 


Tyr Arg 
200 


Ala 


Ser 


Tyr 


lys 


Gly 


Leu 


Leu Pro 
215 


Lys 


Pro 


Lys 


Phe 


Phe 


He 


Ala Gin 
230 


Val 


Val 


Thr 


Tyr 


Pro 


Phe 


Asp Thr 
245 


Val 


Arg 


Arg 


Glu 


Ala 


Lys 


Arg Gin 
260 


Tyr 


Lys 


Gly 


lie 


Tyr 


Gin 


His Glu 
275 


Gly 


He 


Ser 


Ser 


Asn 


Val 


Leu Arg 
290 


Gly 


Thr 


Gly 


Tyr 


Asp 


Lys 


He Lys 
305 


Glu 


Phe 


Phe 



<210> 20 
<211> 540 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5686561CD1 

<400> 20 



Met 


Val 


Pro 


Ala 


Gly 


Trp 


Val 


Arg 


1 








5 








Gly 


Gly 


Asp 


Pro 


Val 
20 


Val 


Pro 


Trp 


Gin 


Asp 


Asp 


Gly 


Gin 
35 


Asp 


Arg 


Glu 


Leu 


Pro 


Glu 


Ser 


Leu 
50 


Thr 


Ser 


Leu 


('Asn 


Asn 


Pro 


Asp 


Val 
65 


Met 


He 


Pro 


Tyr 


Ala 


He 


Phe 


Phe 
80 


He 


Val 


Phe 


Leu 


Met 


Asn 


Leu 


Leu 
95 


Thr 


Ala 


He 


Tyr 


Leu 


Met 


Lys 


Ser 
110 


Leu 


Gin 


Thr 


Gly 


Thr 


Arg 


Ala 


Ala 


Phe 


Glu 


Val 








125 








Gly 


Gly 


Ala 


Phe 


Pro 


Gin 


Ala 


Val 








140 








Leu 


Gin 


Val 


Leu 


Gin 
155 


Lys 


Val 


Gin 


Ala 


Met 


Met 


Glu 


Lys 
170 


Val 


Arg 


Ser 


Ala 


Glu 


Glu 


Phe 


Gin 
185 


Lys 


Leu 


Phe 


Val 


Lys 


Glu 


His 


Pro 


Pro 


Arg 


Pro 



Tyr Pro 


Leu Asp Phe Ala 


Arg 


145 




150 


Lys Gly 


Pro Glu Glu Arg 


Gin 


160 




165 


Met Lys 


He Ala Lys Ser 


Asp 


175 




180 


Phe Gly 


Val Ser Val Gin 


Gly 


190 




195 


Phe Gly 


Ala Tyr Asp Thr 


Val 


205 




210 


Lys Thr 


Pro Phe Leu Val 


Ser 


220 




225 


Thr Cys 


Ser Gly He Leu 


Ser 


235 




240 


Arg Met 


Met Met Gin Ser 


Gly 


250 




255 


Thr Leu 


Asp Cys Phe Val 


Lys 


265 




270 


Ser Phe 


Phe Arg Gly Ala 


Phe 


280 




285 


Gly Ala 


Leu Val Leu Val 


Leu 


295 




300 


His He 


Asp He Gly Gly 


Arg 


310 




315 


Gly Leu 


Glu Leu Ser Leu 


Trp 


10 




15 


Ser Cvs 


Arg Phe Cys Ser 


Gin 


25 




30 


Arg Leu 


Thr Tyr Phe Gin 


Asn 


40 




45 


Leu Val 


Leu Leu Thr Thr 


Ala 


55 




60 


Ala Tyr 


Ser Lys Asn Arg 


Ala 


70 




75 


Thr Val 


He Gly Ser Leu 


Phe 


85 




90 


He Tyr 


Ser Gin Phe Arg 


Gly 


100 




105 


Ser Leu 


Phe Arg Arg Arg 


Leu 


115 




120 


Leu Ser 


Ser Met Val Gly 


Glu 


130 




135 


Gly Val 


Lys Pro Gin Asn 


Leu 


145 




150 


Leu Asp 


Ser Ser His Lys 


Gin 


160 




165 


Tyr Asp 


Ser Val Leu Leu 


Ser 


175 




180 


Asn Glu 


Leu Asp Arg Ser 


Val 


190 




195 


Glu Tyr 


Gin Ser Pro Phe 


Leu 
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200 








205 






210 


Gin 


Ser 


Ala Gin 


Phe 
215 


Leu 


Phe 


Gly 


His Tyr 
220 


Tyr 


Phe Asp Tyr 


Leu 
225 


Gly 


Asn 


Leu He 


Ala 
230 


Leu 


Ala 


Asn 


Leu Val 
235 


Ser 


He Cys Val 


Phe 
240 


Leu 


Val 


Leu Asp 


Ala 
245 


Asp 


Val 


Leu 


Pro Ala 
250 


Glu 


Arg Asp Asp 


Phe 
255 


He 


Leu 


Gly He 


Leu 
260 


Asn 


Cys 


Val 


Phe He 
265 


Val 


Tyr Tyr Leu 


Leu 
270 


Glu 


Met 


Leu Leu 


Lys 
275 


Val 


Phe 


Ala 


Leu Gly 
280 


Leu 


Arg Gly Tyr 


Leu 
285 


Ser 


Tyr 


Pro Ser 


Asn 
290 


Val 


Phe 


Asp 


Gly Leu 
295 


Leu 


Thr Val Val 


Leu 
300 


Leu 


Val 


Leu Glu 


He 
305 


Ser 


Thr 


Leu 


Ala Val 
310 


Tyr 


Arg Leu Pro 


His 
315 


Pro 


Gly 


Trp Arg 


Pro 
320 


Glu 


Met 


Val 


Gly Leu 
325 


Leu 


Ser Leu Trp 


Asp 
330 


Met 


Thr 


Arg Met 


Leu 
335 


Asn 


Met 


Leu 


He Val 
340 


Phe 


Arg Phe Leu 


Arg 
345 


He 


He 


Pro Ser 


Met 
350 


Lys 


Pro 


Met 


Ala Val 
355 


Val 


Ala Ser Thr 


Val 
360 


Leu 


Gly 


Leu Val 


Gin 
365 


Asn 


Met 


Arg 


Ala Phe 
370 


Gly 


Gly He Leu 


Val 
375 


Val 


Val 


Tyr Tyr 


Val 
380 


Phe 


Ala 


He 


He Gly 
385 


lie 


Asn Leu Phe 


Arg 
390 


Gly 


Val 


He Val 


Ala 
395 


Leu 


Pro 


Gly 


Asn Ser 
400 


Ser 


Leu Ala Pro 


Ala 
405 


Asn 


Gly 


Ser Ala 


Pro 
410 


Cys 


Gly 


Ser 


Phe Glu 
415 


Gin 


Leu Glu Tyr 


Trp 
420 


Ala 


Asn 


Asn Phe 


Asp 
425 


Asp 


Phe 


Ala 


Ala Ala 
430 


Leu 


Val Thr Leu 


Trp 
435 


Asn 


Leu 


Met Val 


Val 
440 


Asn 


Asn 


Trp 


Gin Val 
445 


Phe 


Leu Asp Ala 


Tyr 
450 


Arg 


Arg 


Tyr Ser 


Gly 
455 


Pro 


Trp 


Ser 


Lys He 
460 


Tyr 


Phe Val Leu 


Trp 
465 


Trp 


Leu 


Val Ser 


Ser 
470 


Val 


He 


Trp 


Val Asn 
475 


Leu 


Phe Leu Ala 


Leu 
480 


He 


Leu 


Glu Asn 


Phe 
485 


Leu 


His 


Lys 


Trp Asp 
490 


Pro 


Arg Ser His 


Leu 
495 


Gin 


Pro 


Leu Ala 


Gly 
500 


Thr 


Pro 


Glu 


Ala Thr 
505 


Tyr 


Gin Met Thr 


Val 
510 


Glu 


Leu 


Leu Phe 


Arg 
515 


Asp 


He 


Leu 


Glu Glu 
520 


Pro 


Gly Glu Asp 


Glu 
525 


Leu 


Thr 


Glu Arg 


Leu 
530 


Ser 


Gin 


His 


Pro His 
535 


Leu 


Trp Leu Cys 


Arg 
540 



<210> 21 

<211> 322 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc__feature 

<223> Incyte ID No: 1553725CD1 

<400> 21 

Met Glu Ala Asp Leu Ser Gly Phe Asn He Asp Ala Pro Arg Trp 
15 10 15 

Asp Gin Arg Thr Phe Leu Gly Arg Val Lys His Phe Leu Asn He 

20 25 30 

Thr Asp Pro Arg Thr Val Phe Val Ser Glu Arg Glu Leu Asp Trp 

35 40 45 
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Ala 


Lys 


Val Met 


Val 
50 


Glu 


Lys 


Ser 


Gly 


Thr 


Gin Val 


Glu 
65 


Gin 


Leu 


Leu 


Ser 


Ala 


Phe His 


Pro 
80 


Asp 


Thr 


Gly 


Arg 


Met 


Ser Phe 


Gin 
95 


Leu 


Pro 


Gly 


Met 


Leu 


Gin Phe 


Tyr 
110 


Arg 


Thr 


Met 


Trp 


Val 


Asn Gin 


Ser 
125 


Phe 


Asn 


Ala 


Asn 


Ala 


Ala Ser 


Pro 
140 


Thr 


Ser 


Val 


Phe 


Thr 


Ala Thr 


Thr 
155 


Thr 


Ala 


Val 


Met 


Leu 


Thr Lys 


Lys 
170 


Ala 


Pro 


Pro 


Phe 


Ala 


Ala Val 


Ala 
185 


Ala 


Ala 


Asn 


Arg 


Gin 


Gin Glu 


Leu 
200 


He 


Lys 


Gly 


Glu 


Asn 


Glu He 


Gly 
215 


His 


Ser 


Arg 


Thr 


Gin 


Val Val 


He 
230 


Ser 


Arg 


He 


He 


Leu 


Leu Pro 


Val 
245 


He 


Met 


Glu 


Met 


Gin 


Lys Val 


Lys 
260 


Val 


Leu 


His 


Ser 


Gly 


Cys Phe 


Leu 
275 


He 


Phe 


Met 


Phe 


Pro 


Gin Lys 


Cys 
290 


Glu 


Leu 


Pro 


Leu 


Gin 


Asp Thr 


He 
305 


Lys 


Ala 


Lys 


Val 


Tyr 


Phe Asn 


Lys 
320 


Gly 


Leu 





<210> 22 
<211> 417 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1695770CD1 

<400> 22 



Met Thr 


Thr 


Leu 


Val 


Pro 


Ala 


Thr 


1 






5 








Thr Leu 


Pro 


Gly 


Gin 


Val 


Leu 


Leu 








20 








Glu Val 


Lys 


Ser 


Gly 


Thr 


Lys 


Gly 








35 








Asp Phe 


Leu 


Asp 


Lys 


Leu 


Met 


Gly 








50 








Arg He 


Arg 


Pro 


Asn 


Phe 


Lys 


Gly 








65 








Asn He 


Phe 


He 


Asn 


Ser 


Phe 


Ser 








80 








Asp Tyr 


Arg 


Val 


Asn 


Val 


Phe 


Leu 



95 



Arg Met 


Gly Val Val 


Pro 


Pro 


55 








60 


Tyr Ala 


Lys Lys 


Leu 


Tyr 


Asp 


70 








75 


Glu Lys 


Met Asn Val 


He 


Gly 


85 








90 


Gly Met 


He He 


Thr 


Gly 


Phe 


100 








105 


Pro Ala 


Val He 


Phe 


Trp 


Gin 


115 








120 


Leu Val 


Asn Tyr Thr 


Asn 


Arg 


130 








135 


Arg Gin 


Met Ala 


Leu 


Ser 


Tyr 


145 








150 


Ala Thr 


Ala Val Gly 


Met 


Asn 


160 








165 


Leu Val 


Gly Arg Trp 


Val 


Pro 


175 








180 


Cys Val 


Asn He 


Pro 


Met 


Met 


190 








195 


He Cys 


Val Lys Asp 


Arg 


Asn 


205 








210 


Arg Ala 


Ala Ala 


He 


Gly 


He 


220 








225 


Thr Met 


Ser Ala 


Pro 


Gly 


Met 


235 








240 


Arg Leu 


Glu Lys 


Leu 


His 


Phe 


250 








255 


Ala Pro 


Leu Gin 


Val 


Met 


Leu 


265 








270 


Val Pro 


Val Ala 


Cys 


Gly 


Leu 


280 








285 


Val Ser 


Tyr Leu 


Glu 


Pro 


Lys 


295 








300 


Tyr Gly 


Glu Leu 


Glu 


Pro 


Tyr 


310 








315 



Leu Ser 


Phe Leu Leu 


Leu 


Trp 


10 






15 


Arg Val 


Ala Leu Ala 


Lys 


Glu 


25 






30 


Ser Gin 


Pro Met Ser 


Pro 


Ser 


40 






45 


Arg Thr 


Ser Gly Tyr 


Asp 


Ala 


55 






60 


Pro Pro 


Val Asn Val 


Thr 


Cys 


70 






75 


Ser Val 


Thr Lys Thr 


Thr 


Met 


85 






90 


Arg Gin 


Gin Trp Asn 


Asp 


Pro 


100 






105 
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Arg 


Leu 


Ser 


Tyr Arg 
110 


Glu 


Tyr 


Pro 


Pro 


Ser 


Met 


Leu Asp 
125 


Ser 


He 


Trp 


Asn 


Glu 


Lys 


Gly Ala 
140 


Asn 


Phe 


His 


Leu 


Leu 


Arg 


He Phe 
155 


Lys 


Asn 


Gly 


Leu 


Thr 


Leu 


He Leu 
170 


Ser 


Cys 


Leu 


Met 


Asp 


He 


Gin Thr 
185 


Cys 


Thr 


Met 


Thr 


Met 


Lys 


Asp Leu 
200 


Val 


Phe 


Glu 


Val 


Gin 


Val 


Ala Glu 
215 


Gly 


Leu 


Thr 


Asp 


Glu 


Lys 


Asp Leu 
230 


Gly 


Cys 


Cys 


Lys 


Phe 


Thr 


Cys He 
245 


Glu 


Val 


Lys 


Gly 


Tyr 


Tyr 


Leu He 
260 


Gin 


Met 


Tyr 


He 


Leu 


Ser 


Trp Val 
275 


Ser 


Phe 


Trp 


Ala 


Arg 


Val 


Gly Leu 
290 


Gly 


He 


Thr 


Gin 


Ser 


Ser 


Gly Ser 
305 


Arg 


Ala 


Ser 


Lys 


Ala 


He 


Asp He 
320 


Trp 


Met 


Ala 


Ala 


Ala 


Leu 


Leu Glu 
335 


Tyr 


Ala 


Ala 


His 


Lys 


Glu 


Phe He 
350 


Arg 


Leu 


Arg 


Leu 


Glu 


Glu 


Asp He 
365 


He 


Gin 


Glu 


Tyr 


Gly 


Leu 


Gly His 
380 


Cys 


Leu 


Gin 


Glu 


Gly 


Ser 


Gly He 
395 


Tyr 


Ser 


Pro 


Arg 


Glu 


Gly 


Glu Thr 
410 


Thr 


Arg 


Lys 



<210> 23 
<211> 1864 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 4672222CD1 

<400> 23 

Met Ser Gin Lys Ser Trp He Glu 

1 5 
Cys Val Tyr He He Pro Ser Ser 
20 

Pro Gly Cys Gin He Cys Gin Gin 
35 

Arg Leu Val Lys Gin His Ala Cys 
50 

Lys Tyr Ser Asp Val Lys Leu Gly 
65 



Asp Asp 


Ser Leu 


Asp Leu 


Asp 


115 








120 


Lys Pro 


Asp Leu 


Phe 


Phe 


Ala 


130 








135 


Glu Val 


Thr Thr 


Asp Asn 


Lys 


145 








150 


Asn Val 


Leu Tyr 


Ser 


He 


Arg 


160 








165 


Met Asp 


Leu Lys 


Asn . Phe 


Pro 


175 








180 


Gin Leu 


Glu Ser 


Phe Gly 


Tyr 


190 








195 


Trp Leu 


Glu Asp 


Ala 


Pro 


Ala 


205 








210 


Leu Pro 


Gin Phe 


He 


Leu 


Arg 


220 








225 


Thr Lys 


His Tyr 


Asn 


Thr 


Gly 


235 








240 


Phe His 


Leu Glu 


Arg 


Gin 


Met 


250 








255 


He Pro 


Ser Leu 


Leu 


He 


Val 


265 








270 


He Asn 


Met Asp 


Ala 


Ala 


Pro 


280 








285 


Thr Val 


Leu Thr 


Met 


Thr 


Thr 


295 








300 


Leu Pro 


Lys Val 


Ser 


Tyr 


Val 


310 








315 


Val Cys 


Leu Leu 


Phe 


Val 


Phe 


325 








330 


He Asn 


Phe Val 


Ser Arg 


Gin 


340 








345 


Arg Arg 


Gin Arg 


Arg 


Gin 


Arg 


355 








360 


Ser Arg 


Phe Tyr 


Phe 


Arg 


Gly 


370 








375 


Ala Arg 


Asp Gly 


Gly Pro 


Met 


385 








390 


Gin Pro 


Pro Ala 


Pro 


Leu 


Leu 


400 








405 


Leu Tyr 


Val Asp 








415 










Ser Thr 


Leu Thr 


Lys 


Arg 


Glu 


10 








15 


Lys Asp 


Pro His 


Arg Cys 


Leu 


25 








30 


Leu Val 


Arg Cys 


Phe Cys 


Gly 


40 








45 


Phe Thr 


Ala Ser 


Leu 


Ala 


Met 


55 








60 


Asp His 


Phe Asn 


Gin Ala 


He 


70 








75 
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Glu 


Glu 


Trp 


Ser 


Val 
80 


Ala 


Tyr 


Gly 


Val 


He 
95 


Ala 


Lys 


Tyr 


Val 


Arg 
110 


Leu 


Gin 


Leu 


Leu 


Leu 
125 


Val 


He 


Ser 


Val 


His 
140 


Arg 


He 


Lys 


Gin 


Leu 
155 


Thr 


Thr 


Gly 


Ala 


Trp 
170 


Ala 


Lys 


His 


Val 


Gly 
185 


Ser 


Arg 


Lys 


He 


Cys 
200 


Glu 


Asn 


Arg 


Asn 


Asp 
215 


Gin 


Thr 


Leu 


Leu 


Asn 
230 


Leu 


His 


Ser 


His 


Phe 
245 


Tyr 


Gly 


Ala 


Glu 


Val 
260 


Asn 


Gin 


Gin 


Arg 


He 
275 


Val 


Ala 


Leu 


He 


Phe 
290 


Leu 


Glu 


Tyr 


Leu 


Gin 
305 


Glu 


Gly 


Thr 


Gly 


Arg 
320 


Gin 


Thr 


Glu 


Glu 


Gly 
335 


He 


He 


Ser 


Thr 


He 
350 


Ala 


Leu 


His 


Leu 


Phe 
365 


Glu 


Leu 


He 


Thr 


Val 
380 


He 


Asp 


Val 


Ala 


He 
395 


Ser 


Ala 


Phe 


Asp 


Gin 
410 


Asp 


He 


Ala 


Lys 


Asn 
425 


Val 


Gly 


Ser 


Leu 


Glu 
440 


Arg 


Val 


Ala 


Phe 


Val 
455 


His 


Lys 


Phe 


Leu 


Thr 
470 


Lys 


Gin 


Gly 


Pro 


Thr 
485 


Val 


Lys 


Gin 


Gly 


Asn 
500 


Asp 


He 


Gly 


Leu 


Val 
515 


Cys 


Thr 


Tyr 


Thr 


Arg 
530 


Gly 


Gly 


Asn 


Asn 


Arg 



Glu 


Lys 


His 


Thr 


Glu 
85 


Asn 


Phe 


Gin 


Gly 


Gly 
100 


Leu 


Ser 


Tyr 


Asp 


Thr 
115 


Lys 


Glu 


Trp 


Gin 


Met 
130 


Gly 


Gly 


Met 


Gin 


Lys 
145 


Leu 


Gly 


Lys 


Gly 


Leu 
160 


He 


Leu 


Thr 


Gly 


Gly 
175 


Asp 


Ala 


Leu 


Lys 


Glu 
190 


Thr 


He 


Gly 


He 


Ala 
205 


Leu 


Val 


Gly 


Arg 


Asp 
220 


Pro 


Leu 


Ser 


Lys 


Leu 
235 


He 


Leu 


Val 


Asp 


Asp 
250 


Arg 


Leu 


Arg 


Arg 


Glu 
265 


His 


Ala 


Arg 


He 


Gly 
280 


Glu 


Gly 


Gly 


Pro 


Asn 






Pro 




295 


Glu 


Ser 


Pro 


Val 
310 


Ala 


Ala 


Asp 


Leu 


Leu 
325 


Gly 
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Arg 
440 


Asn 


Glu 


Val 


Asp 


Arg 
445 


Thr Ala 


Ala 


Asp 


His 
450 


Gin 


Thr 


He 


Leu 


Arg 
455 


Ala 


Trp 


Ala 


Val 


Lys 
460 


Asp Phe 


Ala 


Pro 


Asn 
465 


Cys 


Pro 


Leu 


Tyr 


Val 


Gin 


He 


Leu 


Lys 


Pro 


Glu Asn 


Lys 


Phe 


His 








470 










475 








480 
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Val 


Lys 


Phe 


Ala 


Asp 
485 


His 


Val 


Val 


Cys 


Glu 
490 


Glu 


Glu 


Cys 


Lys 


Tyr 
495 


Ala 


Met 


Leu 


Ala 


Leu 
500 


Asn 


Cys 


He 


Cys 


Pro 
505 


Ala 


Thr 


Ser 


Thr 


Leu 
510 


lie 


Thr 


Leu 


Leu 


Val 
515 


His 


Thr 


Ser 


Arg 


Gly 
520 


Gin 


Glu 


Gly 


Gin 


Glu 
525 


Ser 


Pro 


Glu 


Gin 


Trp 
530 


Gin 


Arg 


Met 


Tyr 


Gly 
535 


Arg 


Cys 


Ser 


Gly 


Asn 
540 


Glu 


Val 


Tyr 


His 


He 
545 


Arg 


Met 


Gly 


Asp 


Ser 
550 


Lys 


Phe 


Phe 


Arg 


Glu 
555 


Tyr 


Glu 


Gly 


Lys 


Ser 
560 


Phe 


Thr 


Tyr 


Ala 


Ala 
565 


Phe 


His 


Ala 


His 


Lys 
570 


Lys 


Tyr 


Gly 


Val 


Cys 
575 


Leu 


He 


Gly 


Leu 


Lys 
580 


Arg 


Glu 


Asp 


Asn 


Lys 
585 


Ser 


lie 


Leu 


Leu 


Asn 
590 


Pro 


Gly 


Pro 


Arg 


His 
595 


He 


Leu 


Ala 


Ala 


Ser 
600 


Asp 


Thr 


Cys 


Phe 


Tyr 
605 


He 


Asn 


He 


Thr 


Lys 
610 


Glu 


Glu 


Asn 


Ser 


Ala 
615 


Phe 


He 


Phe 


Lys 


Gin 
620 


Glu 


Glu 


Lys 


Arg 


Lys 
625 


Lys 


Arg 


Ala 


Phe 


Ser 
630 


Gly 


Gin 


Gly 


Leu 


His 
635 


Glu 


Gly 


Pro 


Ala 


Arg 
640 


Leu 


Pro 


Val 


His 


Ser 
645 


lie 


He 


Ala 


Ser 


Met 
650 


Gly 


Thr 


Val 


Ala 


Met 
655 


Asp 


Leu 


Gin 


Gly 


Thr 
660 


Glu 


His 


Arg 


Pro 


Thr 
665 


Gin 


Ser 


Gly 


Gly 


Gly 
670 


Gly 


Gly 


Gly 


Ser 


Lys 
675 


Leu 


Ala 


Leu 


Pro 


Thr 
680 


Glu 


Asn 


Gly 


Ser 


Gly 
685 


Ser 


Arg 


Arg 


Pro 


Ser 
690 


lie 


Ala 


Pro 


Val 


Leu 
695 


Glu 


Leu 


Ala 


Asp 


Ser 
700 


Ser 


Ala 


Leu 


Leu 


Pro 
705 


Cys 


Asp 


Leu 


Leu 


Ser 
710 


Asp 


Gin 


Ser 


Glu 


Asp 
715 


Glu 


Val 


Thr 


Pro 


Ser 
720 


Asp 


Asp 


Glu 


Gly 


Leu 
725 


Ser 


Val 


Val 


Glu 


Tyr 
730 


Val 


Lys 


Gly 


Tyr 


Pro 
735 


Pro 


Asn 


Ser 


Pro 


Tyr 
740 


He 


Gly 


Ser 


Ser 


Pro 
745 


Thr 


Leu 


Cys 


His 


Leu 
750 


Leu 


Pro 


Val 


Lys 


Ala 
755 


Pro 


Phe 


Cys 


Cys 


Leu 
760 


Arg 


Leu 


Asp 


Lys 


Gly 
765 


Cys 


Lys 


His 


Asn 


Ser 
770 


Tyr 


Glu 


Asp 


Ala 


Lys 
775 


Ala 


Tyr 


Gly 


Phe 


Lys 
780 


Asn 


Lys 


Leu 


He 


He 
785 


Val 


Ser 


Ala 


Glu 


Thr 
790 


Ala 


Gly 


Asn 


Gly 


Leu 
795 


Tyr 


Asn 


Phe 


He 


Val 
800 


Pro 


Leu 


Arg 


Ala 


Tyr 
805 


Tyr 


Arg 


Ser 


Arg 


Lys 
810 


Glu 


Leu 


Asn 


Pro 


He 
815 


Val 


Leu 


Leu 


Leu 


Asp 
820 


Asn 


Lys 


Pro 


Asp 


His 
825 


His 


Phe 


Leu 


Glu 


Ala 
830 


He 


Cys 


Cys 


Phe 


Pro 
835 


Met 


Val 


Tyr 


Tyr 


Met 
840 


Glu 


Gly 


Ser 


Val 


Asp 
845 


Asn 


Leu 


Asp 


Ser 


Leu 
850 


Leu 


Gin 


Cys 


Gly 


He 
855 


lie 


Tyr 


Ala 


Asp 


Asn 
860 


Leu 


Val 


Val 


Val 


Asp 
865 


Lys 


Glu 


Ser 


Thr 


Met 
870 


Ser 


Ala 


Glu 


Glu 


Asp 
875 


Tyr 


Met 


Ala 


Asp 


Ala 
880 


Lys 


Thr 


He 


Val 


Asn 
885 


Val 


Gin 


Thr 


Met 


Phe 
890 


Arg 


Leu 


Phe 


Pro 


Ser 
895 


Leu 


Ser 


He 


Thr 


Thr 
900 


Glu 


Leu 


Thr 


His 


Pro 
905 


Ser 


Asn 


Met 


Arg 


Phe 
910 


Met 


Gin 


Phe 


Arg 


Ala 
915 


Lys 


Asp 


Ser 


Tyr 


Ser 
920 


Leu 


Ala 


Leu 


Ser 


Lys 
925 


Leu 


Glu 


Lys 


Arg 


Glu 
930 


Arg 


Glu 


Asn 


Gly 


Ser 
935 


Asn 


Leu 


Ala 


Phe 


Met 
940 


Phe 


Arg 


Leu 


Pro 


Phe 
945 


Ala 


Ala 


Gly 


Arg 


Val 


Phe 


Ser 


He 


Ser 


Met 


Leu 


Asp 


Thr 


Leu 


Leu 
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950 








955 




960 


Tyr 


Gin 


Ser 


Phe Val 


Lys 


Asp 


Tyr 


Met He 


Thr He Thr 


Arg Leu 








965 








970 




975 


Leu 


Leu 


Gly 


Leu Asp 


Thr 


Thr 


Pro 


Gly Ser 


Gly Tyr Leu 


Cys Ala 






980 








985 




990 


Met 


Lys 


He 


Thr Glu 


Gly 


Asp 


Leu 


Trp He 


Arg Thr Tyr 


Gly Arg 






995 








1000 




1005 


Leu 


Phe 


Gin 


Lys Leu 


Cys 


Ser 


Ser 


Ser Ala 


Glu He Pro 


He Gly 








1010 








1015 




1020 


He 


Tyr 


Arg 


Thr Glu 


Ser 


His 


Val 


Phe Ser 


Thr Ser Glu 


Pro His 




1025 








1030 




1035 


Glu 


Leu 


Arg 


Ala Gin 


Ser 


Gin 


He 


Ser Val 


Asn Val Glu 


Asp Cys 






1040 








1045 




1050 


Glu 


Asp 


Thr 


Arg Glu 


Val 


Lys 


Gly 


Pro Trp 


Gly Ser Arg 


Ala Gly 








1055 








1060 




1065 


Thr 


Gly 


Glv 


Ser Ser 


Gin 


Gly 


Arg 


His Thr 


Gly Gly Gly 


Asp Pro 








1070 








1075 




1080 


Ala 


Glu 


His 


Pro Leu 


Leu 


Arg 


Arg 


Lys Ser 


Leu Gin Trp 


Ala Arg 








1085 








1090 




1095 


Arc? 


Leu 


Ser 


Ara Lvs 


Ala 


Pro 


Lys 


Gin Ala 


Gly Arg Ala 


Ala Ala 








1100 








1105 




1110 


Ala 


Glu 


Tm 


He Ser 


Gin 


Gin 


Arg 


Leu Ser 


Leu Tyr Arg 


Arg Ser 








1115 








1120 




1125 


Glu 


Ara 


Gin 


Glu Leu 


Ser 


Glu 


Leu 


Val Lys 


Asn Arg Met 


Lys His 






1130 








1135 




1140 


Leu 


Gly 


Leu 


Pro Thr 


Thr 


Gly 


Tyr 


Glu Asp 


Val Ala Asn 


Leu Thr 






1145 








1150 




1155 


Ala 


Ser 


Asp 


Val Met 


Asn 


Arg 


Val 


Asn Leu 


Gly Tyr Leu 


Gin Asp 








1160 








1165 




1170 


Glu 


Met 


Asn 


Asp His 


Gin 


Asn 


Thr 


Leu Ser 


Tyr Val Leu 


He Asn 








1175 








1180 




1185 


Pro 


Pro 


Pro 


Asp Thr 


Arg 


Leu 


Glu 


Pro Ser 


Asp He Val 


Tyr Leu 








1190 








1195 




1200 


He 


Arg 


Ser 


Asp Pro 


Leu 


Ala 


His 


Val Ala 


Ser Ser Ser 


Gin Ser 






1205 








1210 




1215 


Arg 


Lys 


Ser 


Ser Cys 


Ser 


His 


Lys 


Leu Ser 


Ser Cys Asn 


Pro Glu 








1220 








1225 




1230 


Thr 


Arg 


Asp 


Glu Thr 


Gin 


Leu 
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Met 


Ala 


Ser 


Ala 


Leu 


Ser 


Tyr 


Val 


Ser 


Lys 


Phe Lys Ser 


Phe 


Val 


1 








5 










10 






15 


He 


Leu 


Phe 


Val 


Thr 


Pro 


Leu 


Leu 


Leu 


Leu 


Pro Leu Val 


He 


Leu 










20 










25 






30 


Met 


Pro 


Ala 


Lys 


Phe 


Val 


Arg 


Cys 


Ala 


Tyr 


Val He He 


Leu 


Met 










35 










40 






45 


Ala 


He 


Tyr 


Trp 


Cys 


Thr 


Glu 


Val 


He 


Pro 


Leu Ala Val 


Thr 


Ser 






50 










55 






60 


Leu 


Met 


Pro 


Val 


Leu 


Leu 


Phe 


Pro 


Leu 


Phe 


Gin He Leu 


Asp 


Ser 










65 










70 






75 


Arg 


Gin 


Val 


Cys 


Val 


Gin 


Tyr 


Met 


Lys 


Asp 


Thr Asn Met 


Leu 


Phe 






80 










85 






90 


Leu 


Gly 


Gly 


Leu 


He 


Val 


Ala 


Val 


Ala 


Val 


Glu Arg Trp 


Asn 


Leu 
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His 


Lys 


Arg 


Pro 


Ala 


Arg 


Ser 


Met 


Tip 


lie 


Val 


Glu 


Thr 


Glu 


Ala 


Leu 


Pro 


Ala 


Phe 


Glu 


Gly 


Arg 


Leu 


Cys 


lie 


Gly 


Gly 


Leu 


Leu 


Gly 


Val 


Asn 


Phe 


Val 


Met 


Leu 


Arg 


Phe 


Asn 


Lys 


Asn 


Glu 


Lys 


Leu 


Gly 


Phe 


Phe 


Leu 


Met 


Pro 


Gly 


Pro 


Phe 


Tyr 


Lys 


Val 


Pro 


Leu 


Ala 


Lys 


Lys 


Gin 


Met 


Leu 


lie 


Leu 


Asn 


Val 


Ala 


Ser 


Arg 


Ser 


Thr 


Leu 


Ser 


Pro 


Asn 


Ala 


Met 


Val 


Lys 


Val 


Phe 


Leu 


Asp 


His 


Phe 
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95 




lie 


Ala 


Leu 




110 




Leu 


Met 


Leu 




125 




He 


Ser 


Asn 




140 




Ala 


He 


Leu 




155 




Glv 


Leu 


Glu 




170 




Asn 


Ser 


Ala 




185 




Pro 


Thr 


Leu 




200 




Lys 


Ala 


Met 




215 




Thr 


Ala 


Thr 




230 




Gin 


Met 


Asn 




245 




Ala 


Ser 






260 




Leu 


Phe 


Ala 




275 




Phe 


Lys 


Lys 




290 




Lys 


Ala 


Ala 




305 




Pro 


Leu 


Ser 




320 




Leu 


Val 


He 




335 






Leu 


Thr 




350 




Pro 


Pro 


Pro 




365 






Gly 


He 




380 




Gly 


Ser 


Glu 




395 




Glu 


Pro 


Leu 








Ser 


Leu 


Leu 




425 




Thr 


Thr 


Thr 




440 




He 


Glv 


Leu 




455 




Ala 


Ser 


Phe 




470 




lie 


Val 


Phe 




485 




Thr 


Gly 


Val 




500 




Ala 


Val 


Asn 




515 




Pro 


Asp 


Trp 




530 





Arg 


Thr 


Leu 


Gly 


Phe 


Met 


Thr 


Ala 


Thr 


Gin 


Gin 


Met 


Leu 


Val 


Asp 


Val 


Pro 


Thr 


Gly 


Gin 


Gin 


Thr 


Leu 


Cys 


Leu 


Thr 


Gly 


Glu 


Leu 


Phe 


Phe 


Ala 


Phe 


Trp 


Leu 


Trp 


Ser 


Trp 


Gly 


Leu 


Lys 


Val 


Phe 


Ala 


Glu 


Leu 


Trp 


Phe 


Val 


Ala 


Trp 


Leu 


Leu 


Asp 


Val 


Leu 


Leu 


Ala 


Ser 


Gly 


His 


Ala 


Val 


Val 


Ala 


Val 


Leu 


Phe 


Leu 


Asn 


Pro 


Leu 


Ala 


Phe 


Met 


Thr 


Tyr 


Gly 


He 


Met 


Asn 


Thr 


Trp 


Gly 


Ala 


Asn 


Val 



100 






Leu 


Trp 


Val 


115 






Gly 


Val 


Thr 


130 






Thr 


Ala 


Met 


145 






Glu 


Ala 


Thr 


160 






LVS 


Gly 


Lys 


175 






Thr 


Gly 


Ser 


190 






Glu 


Asp 


Gin 


205 






He 


Cys 


Tvr 


220 






Thr 


Glv 


Pro 


235 






Pro 


Asp 


Ser 


250 






Ala 


Phe 


Pro 


265 






Leu 


Gin 


Phe 


280 






Cys 


Glv 


Leu 


295 






Leu 


Gin 


Glu 


310 






He 


Asn 


Val 


325 






Ser 


Arg 


Asp 


340 






Val 


Glu 


Glu 


355 








Lys 


Val 


370 






Leu 


Glv 


Glv 


385 






Leu 


Ser 


Val 


400 






Pro 


Pro 


Ala 


415 






Phe 


Thr 


Glu 


430 






Pro 


He 


Phe 


445 






Tvr 


He 


Met 


460 






Leu 


Pro 


Val 


475 






His 


Leu 


Lys 


490 






He 


He 


Gly 


505 






Arg 


Ala 


He 


520 






Thr 


His 


He 


535 











105 


Gly 


Ala 


Lys 
120 


Ala 


Leu 


Leu 
135 


Met 


Val 


Pro 
150 


Ser 


Ala 


Ala 
165 


Ala 


Lys 


Glu 
180 


Gin 


Val 


He 
195 


Glu 


Arg 


Lys 
210 


Ala 


Ala 


Ser 
225 


Asn 


Val 


Val 
240 


Lys 


Asp 


Leu 
255 


Asn 


Met 


Leu 
270 


Val 


Tyr 


Met 
285 


Glu 


Ser 


Lys 
300 


Glu 


Tyr 


Arg 
315 


Leu 


He 


Cys 
330 


Pro 


Gly 


Phe 
345 


Arg 


Lys 


Thr 
360 


Thr 


Gin 


Glu 
375 


Gly 


Phe 


Ala 
390 


Trp 


Met 


Gly 
405 


Ala 


He 


Thr 
420 


Cys 


Thr 


Ser 
435 


Ala 


Ser 


Met 
450 


Leu 


Pro 


Cys 
465 


Ala 


Thr 


Pro 
480 


Val 


Ala 


Asp 
495 


Val 


Phe 


Cys 
510 


Phe 


Asp 


Leu 
525 


Glu 


Thr 
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<400> 26 



Met 


Lys 


Ala 


His 


Pro 


LVS 


Glu Met Val Pro Leu Met Gly Lys Arg 


1 








5 






10 15 


Val 


Ala 


Ala 


Pro 


Ser 


Gly 


Asn Pro Ala Val Leu Pro Glu Lys Arg 










20 






25 30 


Pro 


Ala 


Glu 


He 


Thr 


Pro 


Thr Lys Lys Ser He Ser Gly Asn Cys 










35 






40 45 


Asp 


Asp 


Met 


Asp 


Ser 


Pro 


Gin Ser Pro Gin Asp Asp Val Thr Glu 










50 






55 60 


Thr 


Pro 


Ser 


Asn 


Pro 


Asn 


Ser -Pro Ser Ala Gin Leu Ala Lys Glu 










65 






/ V / Z> 


Glu 


Gin 


Arg 


Arg 


Lys 
80 


Lys 


Arg 


Arg Leu Lys Lys Arg He Phe Ala 

o rz on 
Oj 


Ala 


Val 


Ser 


Glu 


Glv 
95 


Cys 


Val 


Glu Glu Leu Val Glu Leu Leu Val 
i nn 1 fiR 

1UU X\JD 


Glu 


Leu 


Gin 


Glu 


Leu 
110 


Cys 


Arg 


Arg Arg His Asp Glu Asp Val Pro 

lie 1 9f> 


Asp 


Phe 


Leu 


Met 


His 
125 


Lys 


Leu 


Thr Ala Ser Asp Thr Gly Lys Thr 

ion 1 


Cys 


Leu 


Met 


Lys 


Ala 
140 


Leu 


Leu 


Asn He Asn Pro Asn Thr Lys Glu 


He 


Val 




He 


Leu 
155 


Leu 


Ala 


Phe Ala Glu Glu Asn Asp He Leu 
i fin 1 fiR 


Glv 


At* cr 


Phe 


He 


Asn 
170 


Ala 


Glu 


Tyr Thr Glu Glu Ala Tyr Glu Gly 


Gin 


Thr 


Ala 


Leu 


Asn 
185 


He 


Ala 


He Glu Arg Arg Gin Gly Asp He 

-L -7 U JLJ^J 


Ala 


Ala 


Leu 


Leu 


He 
200 


Ala 


Ala 


Gly Ala Asp Val Asn Ala His Ala 


Lys 


Glv 


Ala 


Phe 


Phe 
215 


Asn 


Pro 


Lys Tyr Gin His Glu Gly Phe Tyr 


Phe 


Glv 


Glu 


Thr 


Pro 
230 


Leu 


Ala 


Leu Ala Ala Cys Thr Asn Gin Pro 

235 240 


Glu 


He 


Val 


Gin 


Leu 
245 


Leu 


Met 


Glu His Glu Gin Thr Asp He Thr 
250 255 


Ser 


Arc 


Asd 


Ser 


Arg 
260 


Glv 


Asn 


Asn He Leu His Ala Leu Val Thr 
265 270 


Val 


Ala 


Glu 


Asp 


Phe 
275 


Lys 


Thr 


Gin Asn Asp Val Val Lys Arg Met 
280 285 


TVX 


Asp 


Met 


He 


Leu 
290 


Leu 


Arg 


Ser Gly Asn Trp Glu Leu Glu Thr 
295 300 


Thr 


Arg 


Asn 


Asn 


Asp 
305 


Gly 


Leu 


Thr Pro Leu Gin Leu Ala Ala Lys 
310 315 


Met 


Gly 


Lys 


Ala 


Glu 
320 


He 


Leu 


Lys Tyr He Leu Ser Arg Glu He 
325 330 


Lys 


Glu 


Lys 


Arg 


Leu 
335 


Arg 


Ser 


Leu Ser Arg Lys Phe Thr Asp Trp 
340 345 


Ala 


Tyr 


Gly 


Pro 


Val 
350 


Ser 


Ser 


Ser Leu Tyr Asp Leu Thr Asn Vai 
355 360 


Asp 


Thr 


Thr 


Thr 


Asp 
365 


Asn 


Ser 


Val Leu Glu He Thr Val Tyr Asn 
370 375 


Thr 


Asn 


lie 


Asp 


Asn 
380 


Arg 


His 


Glu Met Leu Thr Leu Glu Pro Leu 
385 390 


His 


Thr 


Leu 


Leu 


His 
395 


Met 


Lys 


Trp Lys Lys Phe Ala Lys His Met 
400 405 


Phe 


Phe 


Leu 


Ser 


Phe 
410 


Cys 


Phe 


Tyr Phe Phe Tyr Asn He Thr Leu 
415 420 
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Thr 


Leu 


Val Ser 


Tyr 
425 


Tyr 


Arg 


Pro 


Arg 


His 


Pro 


Leu Ala 


Leu 
440 


Thr 


His 


Lys 


Met 


Gly 


Arg 


Met Phe 


Val 
455 


Leu 


He 


Trp 


Ala 


Glu 


Gly 


He Ala 


He 
470 


Phe 


Leu 


Leu 


Arg 


He 


Leu 


Ser Asp 


Ala 
485 


Trp 


Phe 


His 


Phe 


Val 


Leu 


Val He 


Leu 
500 


Ser 


Val 


Phe 


Leu 


Glu 


Tyr 


Leu Ala 


Cys 
515 


Leu 


Val 


Leu 


Ala 


Asn 


Met 


Leu Tyr 


Tyr 
530 


Thr 


Arg 


Gly 


Phe 


Ser 


Val 


Met He 


Gin 
545 


Lys 


Val 


He 


Leu 


Leu 


Phe 


Val Tyr 


He 
560 


Val 


Phe 


Leu 


Leu 


Ala 


Ser 


Leu He 


Glu 
575 


Lys 


Cys 


Pro 


Lys 


Ser 


Tyr 


Gly Ser 


Phe 
590 


Ser 


Asp 


Ala 


Val 


Thr 


He 


Gly Leu 


Gly 
605 


Asp 


Leu 


Asn 


He 


Pro 


He 


Leu Phe 


Leu 
620 


Phe 


Leu 


Leu 


He 


Phe 


Val 


Leu Leu 


Leu 
635 


Asn 


Met 


Leu 


He 


Val 


Glu 


Asn Val 


Ser 
650 


Lys 


Glu 


Ser 


Glu 


Arg 


Ala 


Arg Thr 


He 
665 


Leu 


Glu 


Phe 


Glu 


Leu 


Arg 


Ser Arg 


Phe 
680 


Arg 


Met 


Gly 


Glu 


Asp 


Asp 


Phe Arg 


Leu 
695 


Cys 


Leu 


Arg 


He 


Glu 


Trp 


Lys Thr 


His 
710 


Val 


Ser 


Phe 


Leu 


Val 


Arg 


Arg Thr 


Asp 
725 


Phe 


Asn 


Lys 


He 


Asn 


Ser 


Lys Thr 


Thr 
740 


Leu 


Asn 


Ala 


Phe 


Pro 


Glu 


Thr Ser 


Val 
755 
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Met Lys Ser His Pro Ala He Gin Ala 

1 5 
Gly Ala Ala Gly Gly Gly Ala Cys Val 
20 

Asp Thr He Lys Val Lys Met Gin Thr 
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Glu 


Glu 


Glu Ala He 


Pro 


430 






435 


Gly 


Trp 


Leu Gin Leu 


Leu 


445 






450 


Met 


Cys 


He Ser Val 


Lys 


460 






465 


Pro 


Ser 


Asp Leu Gin 


Ser 


475 






480 


Val 


Phe 


Phe He Gin 


Ala 


490 






495 


Tyr 


Leu 


Phe Ala Tyr 


Lys 


505 






510 


Met 


Ala 


Leu Gly Trp 


Ala 


520 






525 


Gin 


Ser 


Met Gly Met 


Tyr 


535 






540 
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1500 


Pro 


Gin 


Ala 


Ala Gin 


Gin 


Glu 


Arg 


Phe Ser 


Ser Leu 


Met 


Val Tyr 








1505 








1510 






1515 


Lys 


Leu 


Pro 


Val Glu 


Asp 


Val 


Arg 


Pro Leu 


Ser Gin 


Ala 


Phe Phe 








1520 








1525 






1530 


Lys 


Leu 


Glu 


He Val 


Lys 


Gin 


Ser 


Phe Asp 


Leu Glu 


Glu 


Tyr Ser 








1535 








1540 






1545 


Leu 


Ser 


Gin 


Ser Thr 


Leu 


Glu 


Gin 


Val Phe 


Leu Glu 


Leu 


Ser Lys 








1550 








1555 






1560 


Glu 


Gin 


Glu 


Leu Gly 


Asp 


Leu 


Glu 


Glu Asp 


Phe Asp 


Pro 


Ser Val 








1565 








1570 






1575 


Lys 


Trp 


Lys 


Leu Leu 


Leu 


Gin 


Glu 


Glu Pro 









1580 1585 

<210> 31 
<211> 1129 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2194064CB1 

<400> 31 

gcggccgcag ccttcgcgat aaacgggctg tcctacgggc tgctgcgctc gctgggcctt 60 
gccttccctg accttgccga gcactttgac cgaagcgccc aggacactgc gtggatcagc 120 
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gccctggccc tggccgtgca gcaggcagcc agccccgtgg gcagcgccct gagcacgcgc 180 
tggggggccc gccccgtggg tgatggttgg gggcgtcctc gcctcgctgg gcttcgtctt 240 
ctcggctttc gccagcgatc tgctgcatct ctacctcggc ctgggcctcc tcgctggctt 300 
tggttgggcc ctggtgttcg cccccgccct aggcaccctc tcgcgttact tctcccgccg 360 
tcgagtcttg gcggtggggc tggcgctcac cggcaacggg gcctcctcgc tgctcctggc 420 
gcccgccttg cagcttcttc tcgatacttt cggctggcgg ggcgctctgc tcctcctcgg 480 
cgcgatcacc ctccacctca ccccctgtgg cgccctgctg ctacccctgg tccttcctgg 540 
agacccccca gccccaccgc gtagtcccct agctgccctc ggcctgagtc tgttcacacg 600 
ccgggccttc tcaatctttg ctctaggcac agccctggtt gggggcgggt acttcgttcc 660 
ttacgtgcac ttggctcccc acgctttaga ccggggcctg gggggatacg gagcagcgct 720 
ggtggtggcc gtggctgcga tgggggatgc gggcgcccgg ctggtctgcg ggtggctggc 780 
agaccaaggc tgggtgcccc tcccgcggct gctggccgta ttcggggctc tgactgggct 840 
ggggctgtgg gtggtggggc tggtgcccgt ggtgggcggc gaagagagct gggggggtcc 900 
cctgctggcc gcggctgtgg cctatgggct gagcgcgggg agttacgccc cgctggtttt 960 
cggtgtactc cccgggctgg tgggcgtcgg aggtgtggtg caggccacag ggctggtgat 1020 
gatgctgatg agcctcgggg ggctcctggg ccctcccctg tcaggtaagg acctgagctc 1080 
acagatctgc ctacaactat cctctgcccc tggggttcga ggcttctaa 1129 

<210> 32 

<211> 2699 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2744094CB1 

<400> 32 

agtgtgctgg aaagtgttat tattatttat aaaactgaac ctgcaggacc tgcataaggt 60 
gaactaactt ctctagaaca ctgccctgtg ccaggcactg ggctgagtgc ttcacacaca 120 
ttatatcata tcatcctcag catggctctg aagctggtaa ggttatcact atttcatagg 180 
taaagaagtg gtatgttgct gacatttggt accttatcca aggtcacacg gtttggaagt 240 
ggtggagcca ggatttgaac ccagggctct ctgactttgg aacccagctc ttcccattcc 300 
tccaaggtgt ccacattggg tggggctcaa gggtttccaa gcactcaaag acagagatgg 360 
ccgagcagct gagtcagcag ttgcctcgca cctgtttgtg gcacttgtac atcaccactg 420 
tctccctccc aggttacatg gtctcttgca taattttctt ctttgtggtg ccgatcgtct 480 
tcttaacgat cttcagcttc tggtggctga gctactggtt ggagcagggc tcggggacca 540 
atagcagccg agagagcaat ggaaccatgg cagacctggg caacattgca gacaatcctc 600 
aactgtcctt ctaccagctg gtgtacgggc tcaacgccct gctcctcatc tgtgtggggg 660 
tctgctcctc agggattttc accaaggtca cgaggaaggc atccacggcc ctgcacaaca 720 
agctcttcaa caaggttttc cgctgcccca tgagtttctt tgacaccatc ccaataggcc 780 
ggcttttgaa ctgcttcgca ggggacttgg aacagctgga ccagctcttg cccatctttt 840 
cagagcagtt cctggtcctg tccttaatgg tgatcgccgt cctgttgatt gtcagtgtgc 900 
tgtctccata tatcctgtta atgggagcca taatcatggt tatttgcttc atttattata 960 
tgatgttcaa gaaggccatc ggtgtgttca agagactgga gaactatagc cggtctcctt 1020 
tattctccca catcctcaat tctctgcaag gcctgagctc catccatgtc tatggaaaaa 1080 
ctgaagactt catcagccag tttaagaggc tgactgatgc gcagaataac tacctgctgt 1140 
tgtttctatc ttccacacga tggatggcat tgaggctgga gatcatgacc aaccttgtga 1200 
ccttggctgt tgccctgttc gtggcttttg gcatttcctc caccccctac tcctttaaag 1260 
tcatggctgt caacatcgtg ctgcagctgg cgtccagctt ccaggccact gcccggattg 1320 
gcttggagac agaggcacag ttcacggctg tagagaggat actgcagtac atgaagatgt 1380 
gtgtctcgga agctccttta cacatggaag gcacaagttg tccccagggg tggccacagc 1440 
atggggaaat catatttcag gattatcaca tgaaatacag agacaacaca cccaccgtgc 1500 
ttcacggcat caacctgacc atccgcggcc acgaagtggt gggcatcgtg ggaaggacgg 1560 
gctctgggaa gtcctccttg ggcatggctc tcttccgcct ggtggagccc atggcaggcc 1620 
ggattctcat tgacggcgtg gacatttgca gcatcggcct ggaggacttg cggtccaagc 1680 
tctcagtgat ccctcaagat ccagtgctgc tctcaggaac catcagattc aacctagatc 1740 
cctttgaccg tcacactgac cagcagatct gggatgcctt ggagaggaca ttcctgacca 1800 
aggccatctc aaagttcccc aaaaagctgc atacagatgt ggtggaaaac ggtggatact 1860 
tctctgtggg ggagaggcag ctgctctgca ttgccagggc tgtgcttcgc aactccaaga 1920 
tcatccttat cgatgaagcc acagcctcca ttgacatgga gacagacacc ctgatccagc 1980 
gcacaatccg tgaagccttc cagggctgca ccgtgctcgt cattgcccac cgtgtcacca 2040 
ctgtgctgaa ctgtgaccgc atcctggtta tgggcaatgg gaaggtggta gaatttgatc 210O 
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ggccggaggt actgcggaag aagcctgggt 
cttcttcact gagataagga gatgtggaga 
gttcacacag gtgcagcttc gaggcccaca 
cttctcctgg aagcaggggt aaatgtaggg 
aataggctac ttgatggctc tcaagacctt 
tcagtgatca tgtggttctc cttttaactt 
agcttatagt tttctgatct gtgttagaag 
tataaaacta aggaaaactc actttctttg 
tttttttaga cagggtcttg ctctgttgcc 
tcactgtagc ctctgcctcc caggttcaag 



cattgttcgc agccctcatg gccacagcca 2160 
cttcatggag gctggcagct gagctcagag 2220 
gtctgcgacc ttcttgtttg gagatgagaa 2280 
ggggtgggga ttgctggatg gaaaccctgg 2340 
agaaccccag aaccatctaa gacatgggat 2400 
acatgctgaa taattttata ataaggtaaa 2460 
tgttgcaaat gctgtactga ctttgtaaaa 2520 
ttctgcttcc tttcgtttct tttctttttg 2580 
caggctggag cgcagtggcg caatctcagc 2640 
caattctcct gcctcagcct cctgaatag 2699 



<210> 33 

<211> 6369 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 2798241CB1 



<400> 33 

tgccaccaca cccagctaat ttttatatgt 
caggctggtc tcaaactcct gacttcgtga 
gattacaggc gtgagccacc gcacccggtc 
taatattgga ttgtatgaga ctttgggttt 
tttctcttag caggctgtca accaggttag 
ttgaactgta gttgtcattt tagtttattt 
tgtcttgtgt gtgccttgca gtgaacagtc 
cagtttctca agcctttgtc acactaatag 
ttattacagg aattaaaaac aactttttag 
tcccccttct actttttgct tccctgtggc 
gtgtttattt tctccattgt gttacacact 
caatggtagg aggtagagaa gaaaagcaaa 
gttccattga atagagagaa aggttttcct 
gttactgtag tctggccctg ttaccatggg 
cagaaaagaa aaaaagattt gctatttcta 
cccattcctc aattcaaagc taaggcttct 
gataccaaag gagaaaaagt accactgttg 
ctatttcaca tgccttgttt acttttcaga 
ccagtttcct aagagagaca gcttggagta 
agctgcttat tttgccgtta aaaattacat 
gtttgttttt cctctttaat aggtttattc 
aggagtttgg agacagacca gaacacttct 
caaaaagagt agtgttcagg aaattctttt 
aattagcatg atgcatccaa ataagaaata 
tatggacaag tttactcttt ctaatctaat 
aagcagcatc atgcagaaag tgtctactga 
atatacaaat gaaaaagaaa tgttaacatc 
tgtggttttc aaagactcca tgtcctatga 
atcttctatt tatatggatt caagagctgg 
ctggtcctca ggtttcacag ttttacaagc 
gaccaatgtt tctctttgga aggagctgga 
tgctgttgta gaaatagata cctttccccg 
attttcacct tttggatact ttttggcaat 
aaaagaattt ttaaagataa tgggacttca 
tctatataca agtttaattt ttcttatgtc 
tttgttattt cctcaaagta gcagcattgt 
atcatctgta ttttttgctt taatgctgac 
aatagttgaa ttttttgtta ctgtggcttt 
agaaagtttt cccaaatcgt tagtgtggct 
gattggtatt gcacaggtca tgcatttaga 
tttgactgca ggcccatatc ctctaattat 
attctatgtc ctcttggctg tctatcttga 



ttagtagaga cagggtttca ccatgttggt 60 
tctgcccacc ttggcctccc aaagtgctgg 120 
agctattttc tacatgcttc atttgcagtg 180 
tgtgttaata cctacagaaa atgttgatat 240 
gttcaggtca taagtttcta cccacattct 300 
ttcaaaaact tttgcagtac ctttttggtc 360 
tggatttgga cagtggtctg tctgttagtt 420 
gattggattt atgtatgtcc agcttgggaa 480 
agtgctttcc tgagctctct ttctatttgt 540 
tgctgtttct atcctccagc cagagagcta 600 
tgtgcagctg caaccaccat atccagggcc 660 
agggattggc ctcatcctct tacaacgata 720 
gcctcagagt gttggctgca ctaggctttt 780 
attgcttgca tgtggggata caggagaatt 840 
cattctccct gagcattaag acttcccttg 900 
tctggagctg cctctgtggg cggttcggga 960 
atatggtggt atttcaaatt ctggtctacc 1020 
gctgacagat tgctgctcca tgcattctgt 1080 
tgcttaatcc atcttacctg ggactgaaac 1140 
gcagtttact gcgtggctcc gggtttgttt 1200 
agaaaacatg tccactgcaa ttagggaggt 1260 
actgaagaat tacttaatta aatgcagaac 1320 
tccactattt tttttatttt ggttaatatt 1380 
tgaagaagtg cctaatatag aactcaatcc 1440 
tcttggatat actccagtga ctaatattac 1500 
tcatctacct gatgtcataa ttactgaaga 1560 
cagtctctct aagccgagca actttgtagg 1620 
acttcgtttt tttcctgata tgattccagt 1680 
ctgttcaaaa tcatgtgagg ctgctcagta 1740 
atccatagat gctgccatta tacagttgaa 1800 
gtcaactaaa gctgttatta tgggagaaac 1860 
aggagtaatt ttaatatacc tagttatagc 1920 
tcatatcgta gcagaaaaag aaaaaaaaat 1980 
tgatactgcc ttttggcttt cctgggttct 2040 
ccttcttatg gcagtcattg cgacagcttc 2100 
gatatttctg ctttttttcc tttatggatt 2160 
acctcttttt aaaaaatcaa aacatgtggg 2220 
tggatttatt ggccttatga taatcctcat 2280 
tttcagtcct ttctgtcact gtacttttgt 2340 
agattttaat gaaggtgctt cattttcaaa 2400 
tacaattatc atgctcacac ttaatagtat 2460 
tcaagtcatt ccaggggaat ttggcttacg 2520 
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gagatcatct ttatattttc tgaagccttc 
ggagttatca gagggcaatg ttaatggaaa 
ttcttcagaa tttgtaggaa aagaagccat 
aaagaagggt gaaaatgtgg aggctttgag 
gattactgcc ttacttggcc acagtggaac 
tggactctgc ccaccttctg atgggtttgc 
agatgaaatg tttgaagcaa gaaaaatgat 
tgatgttttg acagtagaag aaaatttatc 
caacaatata atacaagaag tgcagaaggt 
agataaccaa gctaaaaaat taagtggtgg 
tgttcttggg aacccaaaga tactgctgct 
ttctcgacat attgtatgga atcttttaaa 
cagtactcat ttcatggatg aagctgacat 
aggaatgctg aaatgtgttg gttcttcaat 
ccgcctgagc atgtacatag acaaatattg 
acaacatata cctggagcta ctttattaca 
gcctttcaag gacatggaca aattttcagg 
tttgggtgtc atttcttatg gtgtttccat 
agaagttgaa gcagaaattg accaagcaga 
ggaagaaatg gattcaaaat cttttgatga 
aaccaaggct tctctagtga gcaccatgag 
aaagtttcat ttctttacct tgaaacgtga 
gcttttaatt tttttcacag ttcagatttt 
tgctgtggtt cccatcaaac ttgttccaga 
acataaatac aaaacaagtc tgcttcttca 
tattagcttt ttcacaagcc agaacataat 
atccgtggct ccccatagtg cggctttaaa 
tgcagctgtt ttcaacagta ctatggttta 
taactactat ctttatcatt taaatgtgac 
ctttcaagaa attactgata tagtttttaa 
tggaatcatt gttactgcaa tgccacctta 
gatcaaagct tatactcaac ttaaactttc 
acaagctgtt gttgatatcc ccttattttt 
attggcattt cattatggat tatattttta 
ccttattggt tatgttccat cagttattct 
gaaaatttta aataccaaag aattttggtc 
tattgcaatc actgaaataa ctttctttat 
tgccttttgt atcatcattc caatctatcc 
gatttcttgg aagaatgtac gaaaaaatgt 
agtagctgtt atatcgcctt acctgcagtg 
tgagaaaaaa tatggaggca gatcaataag 
gaagtctaaa aataggaagc ttccagaacc 
caaagctgaa agactaaagg tcaaagagct 
atccattatg gtcagcaatt tgcataaaga 
aagaaaagta aagaaagtgg caactaaata 
cttaggacta ttgggtccaa atggtgctgg 
tgatattgaa ccaacttcag gccaggtatt 
agatgatgat tcactgaagt gtatgggtta 
tactacattg caggaacatt ttgaaattta 
catgaaagaa gtcataagtc gaataacaca 
gactgtaaag aaactacctg caggaatcaa 
agggaatcct cagattactt tgctagatga 
acagcacatg tggcgagcaa ttcgaactgc 
gaccactcac tatatggagg aggcagaggc 
tgggcagtta agatgtatcg gaacagtaca 
ctttttggaa attaaattga aggactggat 
agaaattcag tatattttcc caaatgcaag 
ttataaaatt cctaaggaag atgttcagtc 
agctaaacat gcttttgcca ttgaagaata 
ttttgtagaa ctcactaaag aacaagagga 
cacactttgg tgggaacgaa cacaagaaga 
gtctgcttac tgggacttct ttctttttca 
ttattggaat ggtaactgga gaaccaagaa 



atattggtca aagagcaaaa gaaattatga 2580 
tattagtttt agtgaaatta ttgagccagt 2640 
aagaattagt ggtattcaga agacatacag 2700 
aaatttgtca tttgacatat atgagggtca 2760 
aggaaagagt acattgatga atattctttg 2820 
atctatatat ggacacagag tctcagaaat 2880 
tggcatttgt ccacagttag atatacactt 294a 
aattttggct tcaatcaaag ggataccagc 3000 
tttactagat ttagacatgc agactatcaa 3060 
tcaaaaaaga aagctgtcat taggaattgc 3120 
agatgaacca acagctggaa tggacccctg 3180 
atacagaaaa gccaatcggg tgacagtgtt 3240 
tcttgcagat aggaaagctg tgatatcaca 3300 
gttcctcaaa agtaaatggg ggatcggcta 3360 
tgccacagaa tctctttctt cactggttaa 3420 
acagaatgac caacaacttg tgtatagctt 3480 
tttgttttct gccctagaca gtcattcaaa 3540 
gacgactttg gaagacgtat ttttaaagct 3600 
ttatagtgta tttactcagc agccactgga 3660 
aatggaacag agcttactta ttctttctga 3720 
cctttggaaa caacagatgt atacaatagc 3780 
aagtaaatca gtgagatcag tgttgcttct 3840 
tatgtttttg gttcatcact cttttaaaaa 3900 
cttatatttt ctaaaacctg gagacaaacc 3960 
aaattctgct gactcagata tcagtgatct 4020 
ggtgacgatg attaatgaca gtgactatgt 4080 
tgtggtgcat tcagaaaagg actatgtttt 4140 
ttctttacct atattagtga atatcattag 4200 
tgaaaccatc cagatctgga gtaccccatt 4260 
aattgagctg tattttcaag cagctttgct 4320 
ctttgccatg gaaaatgcag agaatcataa 4380 
aggtcttttg ccatctgcat attggattgg 4440 
tatcattctt attttgatgc taggaagctt 4500 
tactgtaaag ttccttgctg tggttttttg 4560 
gttcacttat attgcttctt tcacctttaa 4620 
atttatctat tctgtggcag cgttggcttg 4680 
gggatacaca attgcaacta ttcttcatta 4740 
acttctaggt tgcctgattt ctttcataaa 4800 
ggacacctat aatccatggg ataggctttc 4860 
tgtactgtgg attttcctct tacaatacta 4920 
aaaagatccc tttttcagaa acctttcaac 4980 
accagacaat gaggatgaag atgaagatgt 5040 
gatgggttgc cagtgttgtg aggagaaacc 5100 
atatgatgac aagaaagatt ttcttctttc 5160 
catctctttc tgtgtgaaaa aaggagagat 5220 
caaaagcaca attattaata ttctggttgg 5280 
tttaggagat tattcttcag agacaagtga 5340 
ctgtcctcag ataaaccctt tgtggccaga 5400 
tggagctgtc aaaggaatga gtgcaagtga 5460 
tgcacttgat ttaaaagaac atcttcagaa 5520 
acgaaagttg tgttttgctc taagtatgct 5580 
accatctaca ggtatggatc ccaaagccaa 5640 
atttaaaaac agaaagcggg ctgctattct 5700 
tgtctgtgat cgagtagcta tcatggtgtc 5760 
acatctaaag agtaaatttg gaaaaggcta 5820 
agaaaaccta gaagtagacc gccttcaaag 5880 
ccgtcaggaa agtttttctt ctattttggc 5940 
cctttcacaa tcttttttta agctggaaga 6000 
tagcttttct caagcaacat tggaacaggt 6060 
ggaagataat agttgtggaa ctttaaacag 6120 
tagagtagta ttttgaattt gtattgttcg 6180 
cttaatttta actttggttt aaaaagtttt 6240 
cgcacttgaa atttttctaa gctccttaat 6300 
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tgaaatgctg tggttgtgtg ttttgctttt ctttaaataa aacgtatgta taattaaaaa 6360 
aaaaaaaaa 6369 



<210> 34 

<211> 2558 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



3105257CB1 



<220> 

<221> unsure 
<222> 2523 
<223> a, t, c, 



g, or other 



<400> 34 

atggggcgcg 

ctgccggggc 

gctgtgggct 

ccttcccggg 

ctcctggggc 

atgtccggtg 

tataccaacc 

gccgtgtttc 

ggtcagcgca 

gttgctttct 

gcactcctgg 

gcttccgtag 

agcgtggtgc 

accaaagtca 

aatgtaagaa 

aaagtggacc 

ggagcaactg 

ctgatgggca 

tgggttggaa 

ggtgcagggg 

ggggtcatct 

gcctatccag 

ggatctgtca 

ctgctgaggt 

cagctaaacc 

ttttcttgct 

gctgaggaaa 

ccccaagggt 

cagcggattg 

gcaaccagtg 

atggatggaa 

atggttgctg 

tcaaaaccaa 

ggaagcaatt 

actcagagac 

ttccaaagtg 

agttttaata 

attttctttt 

tgctgttaaa 

aagcctgatt 

cctactttta 

tcggtaatct 

ccncgggttt 



gggccggcgc 
gggggtctgg 
tgggcccggc 
acgaccagcg 
tggagtaccc 
ttatctccat 
ccactgtgga 
tgtgtggtgc 
ttgtgaatag 
ttgacaagac 
ggcgctcagt 
gcatcagtat 
ctccagtgtc 
ctcaggattc 
ctgttcgagc 
atgtaatgca 
ggctctccgg 
gtgcccacat 
taagcattgg 
ggcgcctctg 
taaatgagaa 
ctcgcccaga 
cggcactggt 
tgtacgaccc 
cagtgtggct 
ctattgctga 
tccagagagt 
tcaacactgt 
cgattgcccg 
cgctggatgc 
gaacggtgtt 
ttcttgacca 
atgggatata 
actggtaaac 
tatgaaatac 
tgtaaaagat 
attgtaactt 
cttgtggaat 
gattgaagct 
attttaggcc 
agtctaattt 
ctgaggaaac 
acagggggac 



tgctctgggg 
gcggctcggg 
agctgcaggg 
gctacgaccc 
tgagcgccag 
gtctgcccct 
ctacagcgac 
tgccgccaat 
gctgagaact 
tcgcacagga 
gactgaaaac 
gatgtttttt 
aatcattgct 
cctggcacaa 
ttttgggaaa 
gttagcaagg 
aaacctgatc 
gaccgtgggt 
aggtctgagc 
ggagctcctg 
aagcttccag 
ggtgcccata 
tggcccaagt 
tgcttctgga 
gagatccaaa 
gaacattgct 
ggctgaagtg 
ggttggagaa 
tgctctgcta 
cgaaaatgag 
agttattgcc 
aggaaaaatt 
cagaaaacta 
aatatgagac 
ataaaccata 
tgttttgaaa 
tttaaatgtc 
attttaatta 
attgtcaaat 
agtttaccaa 
taccgatcga 
ctcttttttt 
acggaaatgt 



cgttggagcc 
gggccacgcg 
gaggaggcct 
atggcgcccg 
aggctggcag 
ttctttctgg 
aacctgaccc 
gccattcgtg 
tcattattct 
gaattgatta 
ctctcagatg 
gtctcaccta 
gtaatttatg 
gccactcagc 
gaaatgactg 
aaagaggcat 
gtgctttctg 
gaactctctt 
tctttctact 
gagagagagc 
ggtgctttgg 
tttcaggatt 
ggttctggca 
actattagtc 
attgggacag 
tatggtgctg 
gccaatacag 

aagggtgttc 

aagaatccca 
taccttgttc 
catcgtctgt 
actgaatatg 
atgaacaaac 
tttaatgcaa 
tatcaagtta 
cgtacctgtt 
tatagcactg 
atatagcatg 
gacaacttta 
tcactgtgta 
ataacgcgct 
acacccgcgg 
ccctcgaa 



gcgcgccgct 
ggcctcggac 
ggcggcgcgg 
gactctcgga 
ctgcggttgg 
ggaagatcat 
gcctctgcct 
tctacctcat 
cctccattct 
accgcctctc 
ggctcagggc 
atctggccac 
ggcgatatct 
tagctgagga 
aaatcgagaa 
tcgcccgggc 
tcctgtacaa 
ccttcctaat 
cggagctgat 
ccaagctgcc 
agtttaagaa 
tcagcctttc 
aatcaacagt 
ttgatggcca 
tgagtcagga 
atgacccttc 
tggccttcat 
tcctctcagg 
aaattcttct 
aagaagctct 
ccaccattaa 
gaaaacatga 
aaagttttat 
aacagtgttg 
tttgaaaaat 
ctcaagatct 
aagttatttt 
gcacctcatt 
aaaaggcaat 
attcttctgg 
tgtgaatttt 
agaagggagt 



ggaggagctg 
ggctcccggg 
gcgggcggcg 
ggccgggaag 
atttctcacg 
cgatgccatc 
agggctcagt 
gcaaacttca 
gaggcaggag 
atcagacact 
cggggcccag 
ctttgttttg 
acggaaactg 
acgtattgga 
atatgccagc 
tggtttcttt 
aggagggctg 
gtatgctttc 
gaaaggactg 
ttttaacgag 
cgtgcatttt 
cattccgtca 
gctttcactc 
tgacatccgt 
acccattttg 
ctctgtgacc 
ccggaatttc 
tgggcagaaa 
cctagatgaa 
agatcgactg 
gaatgctaat 
agagctgctt 
ttcagcataa 
cagaaaaaaa 
acctattttt 
ttttattcag 
caggttttgt 
ttcttttgcc 
tataaataaa 
tagtattcta 
atacctttat 
ttttttgccc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2558 



<210> 35 
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<211> 5065 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 3200979CB1 

<400> 35 

atggttaaaa aagagataag cgtgcgtcaa 
cttaaaaaat ggagaataaa aagagagttt 
cacttcagag ctacccgttt tcctgaacaa 
tttaatgact ctggcctggt agtggcatat 
atgaataaga tggccttggc ttcctttatg 
gaagagacca tggatataga acttccaaaa 
agtgatactt tctcatatcg cctgaagttt 
gagcactctg aatacacaga acactgttgg 
gcaaagtact ggctaaaagg gtttgtagct 
gaagtcacaa caaatcattc tgtaatggag 
aagataccac ctttcatttc taagggagaa 
ttagtttctt tctcttcttt tatatacttt 
aaatttaaga aactgatgac agtgatgggt 
ggattgacat acatttgctt catcttcatt 
tcaatcccaa ttgtatttca tactggcttc 
ggcctttctt tgatagcatt ggctttcctc 
gctggtttgg ctggatttct cttcactgta 
tatagacaac ttcctttatc tttgggatgg 
actgctggaa tggcccagat tacacacctg 
gatccctctg gggattcata caaaatgata 
cttttctatt tgatattcac attatatttt 
ggggattctc cattattttt ccttaagtcc 
catgaaatct ttgagaatga aataaatcct 
gtgtctccag aattccatgg aaaagaagcc 
aatggaaaga ctggaaaagt agaagcattg 
cagatcactg caatacttgg gcataatgga 
agtggattgt ctgtttctac agaaggatca 
ataactgaca tggaagaaat tagaaagaat 
tttgacttcc tcactgtgag agaaaacctc 
ccaaaggaag tggaacaaga ggttttgctg 
ttttcaagac accgagtgtg gagcctcctg 
ttcagtaccc aattcatgga tgaggctgac 
aatgggaagt tgaaatgtgc aggatcatct 
tatcatttaa gtttacacag gaatgaaatg 
aagcagcaca ttcctgatgc caagttaaca 
ttgcctttgg aaaaaacgaa caaatttcca 
gaccagggca taaggaatta tgctgtttca 
ctagaaggaa aatcagcaat tgatgaacca 
catgtgacaa gaaatactgg agatgagtct 
gaaacaagaa aggctgtcag tagtgcagct 
acacttcgct tcttaaagtt aaggcgtgaa 
cttggaattg cttttatccc catcattcta 
actcattgtt gggagttttc acccagtatg 
acgcctctta ccagcctgtt aatcgttaat 
cattcactga agtgtcagga tatagttttg 
tcagatgatc cctcctacaa tggagccatc 
ttttcagttg catgtaatac caagaaatcg 
agcaatgccc ttattggaat ttttaacttc 
ttttttcgtg atgacatagt gctggatctt 
ttgatcacaa actgcatttc tccttatatt 
ccttcctcta tcccttctat tctttgtcag 
ggcctctggc cttcagcata ctggtgtgga 
ttgattctcc tttcaataca tttaatttac 
tgggaactca tgtttgtttt ggtggtatgc 



caaattcagg ctcttctgta caagaatttt 60 
attgggctat atttgtgcat cttttcggaa 120 
cctcctaaag tcctgggaag cgtggatcag 180 
acaccagtca gtaacataac acaaaggata 240 
aaaggaagaa cagtcattgg gacaccagat 300 
aaataccatg aaatggtggg agttatattt 360 
aattggggat atagaatccc agttataaag 420 
gccatgcatg gtgaaatttt ttgttacttg 480 
tttcaagctg caattaatgc tgcaattata 540 
gagttgacat cagttattgg aataaatatg 600 
attatgaatg aatggtttca ttttacttgc 660 
gcatcattaa atgttgcaag ggaaagagga 720 
ctccgagagt cagcattctg gctctcctgg 780 
atgtccattt ttatggctct ggtcataaca 840 
atggtgatat tcacactcta tagcttatat 900 
atgagtgttt taataaggaa acctatgctc 960 
ttttggggat gtctgggatt cactgtgtta 1020 
gtattaagtc ttcttagccc ttttgccttc 1080 
gataattact taagtggtgt tatttttcct 1140 
gccacttttt tcattttggc atttgatact 1200 
gagcgagttt tacctgataa agatggccat 1260 
tcattttggt ccaaacatca aaatactcat 1320 
gagcattcct ctgatgattc ttttgaaccg 1380 
ataagaatca gaaatgttat aaaagaatat 1440 
caaggcatat tttttgacat atatgaagga 1500 
gctggtaaat caacactgct aaacattctt 1560 
gccactattt ataatactca actctctgaa 1620 
attggatttt gtccacagtt caattttcaa 1680 
agggtatttg ctaaaataaa agggattcag 1740 
ctagatgaac caactgctgg attggatccc 18 DO 
aaggagcata aagtagaccg acttatcctc 1860 
atcttggctg ataggaaagt atttctgtct 1920 
ttgtttctga agcgaaagtg gggtattgga 1980 
tgtgacacag aaaaaatcac atcccttatt 2040 
acagaaagtg aagaaaaact tgtatatagt 2100 
gatctttaca gtgaccttga taagtgttct 2160 
gtgacatctc tgaatgaagt attcttgaac 2220 
gattttgaca ttgggaaaca agagaaaata 2280 
gaaatggaac aggttctttg ttctcttcct 2340 
ctctggagac gacaaatcta tgcagtggca 2400 
aggagagctc ttttgtgttt gttactagta 2460 
gagaagataa tgtataaagt aactcgtgaa 2520 
tatttccttt ctctggaaca aatcccgaag 2580 
aatacaggat caaatattga agacctcgtg 2640 
gaaatagatg actttagaaa cagaaatggc 2700 
atagtgtctg gtgaccagaa ggattacaga 2760 
aattgttttc ctgttcttat gggaattgtt 2820 
acagagctta ttcaaatgga gagcacttca 2880 
ggttttatag atgggtccat atttttgttg 2940 
ggcataagca gcatcagtga ttataaaatc 3000 
aaaaatgttc aatcccagtt atggatttca 3060 
caggctctgg tggacattcc attacacttc 3120 
tacttctcat ttctgggatt ccagcttcca 3180 
ataattggtt gtgcagcttc tcttatattc 3240 
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ctcatgtacg tgctttcatt catcttttgc aagtggagaa aaaataatgg cttttggtct 3300 
tttggctttt ttattgtctt aatatgtgta tccacaattc tggtatcaac taagtatgaa 3360 
aaacccaact taattttgtg catgatattc ataccttcct ttactttcct agatatgtcg 3420 
ttattgatcc agctcaactt tatgtatatg agaaacttgg acagtctgga caatagaata 3480 
aatgaagtca ataaaaccat tcttttaaca aacttaatac cataccttca gagtgttatt 3540 
ttcctttttg tcataaggtg tctggaaatg aagtatggaa atgaaatcat gaataaagac 3600 
ccagttttca gaatctctcc acgaagtaga ggaactcata ccaatccaga agagcctgaa 3660 
gaagatgttc aagctgaaag agtccaagca gcaaatgcac tcactactcc aaacttggag 3720 
gaggaaccag tcataactgc aagctgttta cacaaggaat attatgagac aaagaaaagt 3780 
tgcttttcaa caacaaagaa gaaagcagcc atcagaaatg tttcgttttg tgttaaaaaa 3840 
ggtgaagttt tgggattact aggacacaat ggagctggca aaagtacttc cattaaaatg 3900 
ataactgggt gcacagtgcc aactgcagga gtggtggtgt tacaaggcaa cagagcatca 3960 
gtaaggcaac agcgtgacaa cagcctcaag ttcttggggt actgccctca ggagaactca 4020 
ctgtggccca agcttacaat gaaagagcac ttggagttgt atgcagctgt gaaaggactg 4080 
ggcaaagaag atgctgctct cagtatttca cgattggtgg aagctcttaa gctccaggaa 4140 
caacttaagg ctcctgtgaa aactctatca gagggaataa agagaaagct gtgctttgtg 4200 
ctgagcatcc tggggaaccc atcagtggtg cttctagatg agccgttcac cgggatggac 4260 
cccgaggggc agcagcaaat gtggcagata cttcaggcta ccattaaaaa ccaggagagg 4320 
ggcaccctct tgaccaccca ttacatgtca gaggctaagt ctctgtgtga ccgtgtggcc 4380 
atcatggtgt caggaacgct aaggtgtatt ggttccattc aacatctgaa aaacaagttt 4440 
ggtaaagatt atttactaga aataaaaatg aaagaaccta ctcaggtgga agctctccac 4500 
acagagattt tgaagctttt cccacaggct gcttggcagg aaagatattc ctctttaatg 4560 
gcgtataagt tacctgtgga ggatgtccac cctctatctc gggccttttt caagttagag 4620 
gcgatgaaac agaccttcaa cctggaggaa tacagcctct ctcaggctac cttggagcag 4680 
gtattcttag aactctgtaa agagcaggag ctgggaaatg - ttgatgataa aattgataca 4740 
acagttgaat ggaaacttct cccacaggaa gacccttaaa atgaagaacc tcctaacatt 4800 
caattttagg tcctactaca ttgttagttt ccataattct acaagaatgt ttccttttac 4860 
ttcagttaac aaaagaaaat attcaatagt ttaaacatgc aacaatgatt acagttttca 4920 
tttttaaaaa tttaggatga aggaaaaagg aaatataggg aaaagtagta gacaaaatta 4980 
acaaaatcag acatgttatt atccccaaca tgggtctatt ttgtgcttag gggatccact 5040 
agtttagaac gccggaccgc gtggt 5065 

<210> 36 

<211> 1677 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 6754139CB1 

<400> 36 

gtggaaacag gtttgagaga tactggaggg ggcagagcag tggggtttag aatccctggg 60 
tgaaagtctg gactcttgtg gcttatttgg gcccctctag catttgtgga gaggcaggca 120 
gactccaggt ccttgaaaag gggagggtgg aggagaaatt tgtcagcctg gcgccagaag 180 
atagtaccag ttcactccat ggcctttacc tcatgtgtcc ctgcaggcag gccagggagg 240 
aactagagcc acagctagag caagagaagg cagacaccag gaggacactc ataaggacag 300 
ggccccagcc ctgggagtgg agggtgtgag cagaggccct gggactaggg cctgggatgg 360 
acaaccctcc ttactgaccc tccagagtgc ctgggagctg agggccggct ggctctcaag 420 
ctgttccgtg acctctttgc caactacaca agtgccctga gacctgtggc agacacagac 480 
cagactctga atgtgaccct ggaggtgaca ctgtcccaga tcatcgacat ggatgaacgg 540 
aaccaggtgc tgaccctgta tctgtggata cggcaggagt ggacagatgc ctacctacga 600 
tgggacccca atgcctatgg tggcctggat gccatccgca tccccagcag tcttgtgtgg 660 
cggccagaca tcgtactcta taacaaagcc gacgcgcagc ctccaggttc cgccagcacc 720 
aacgtggtcc tgcgccacga tggcgccgtg cgctgggacg cgccggccat cacgcgcagc 780 
tcgtgccgcg tggatgtagc agccttcccg ttcgacgccc agcactgcgg cctgacgttc 840 
ggctcctgga ctcacggcgg gcaccaagtg gatgtgcggc cgcgcggcgc tgcagccagc 900 
ctggcggact tcgtggagaa cgtggagtgg cgcgtgctgg gcatgccggc gcggcggcgc 960 
gtgctcacct acggctgctg ctccgagccc taccccgacg tcaccttcac gctgctgctg 1020 
cgccgccgcg ccgccgccta cgtgtgcaac ctgctgctgc cctgcgtgct catctcgctg 1080 
cttgcgccgc tcgccttcca cctgcctgcc gactcaggcg agaaggtgtc gctgggcgtc 1140 
accgtgctgc tggcgctcac cgtcttccag ttgctgctgg ccgagagcat gccaccggcc 1200 
gagagcgtgc cgctcatcgg gaagtactac atggccacta tgaccatggt cacattctca 1260 
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acagcactca ccatccttat 
ccagcctggg ctagggccct 
agaggggagc cctgtgggca 
gaaggagggg ctggcccccc 
gaagccctac tgcaccacgt 
cagcgctgcc atgaggactg 
atcttcttct ccatggccct 



catgaacctg cattactgtg 
cctgctggga cacctggcac 
gtccaggcca cctgagttat 
agcgggccct tgccacgagc 
agccaccatt gccaatacct 
gaagcgcctg gcccgtgtga 
ggtcatgagc ctcctggtgc 



gtcccagtgt ccgcccagtg 1320 
ggggcctgtg cgtgcgggaa 1380 
ctcctagccc ccagtcgcct 1440 
cacgatgtct gtgccgccag 1500 
tccgcagcca ccgagctgcc 1560 
tggaccgctt cttcctggcc 1620 
tggtgcaggc cctgtga 1677 



<210> 37 

<211> 3714 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 6996659CB1 



<400> 37 

atgaggagac tgagtttgtg gtggctgctg 
tgcgcactgg tgctggccgg ggtgcccagc 
ctcaagcgca tcgggcacgc ggtgagggtg 
gccccccgcg cggccagccg cgctccggac 
ccggagccag ggactaggcg gtccccggcg 
accctgcatg gccgggggcc gccgggctcc 
gccctgtggc cacgggacgc cctcctattt 
ctgctaccct acaacctgtc tttggaagta 
ctgccacttt tgcccttctc ctcccctagt 
ctgcaaagtg tgtgccatac cgtggtggtg 
cagagccagg gcgaaatgat ggagctcgac 
atcagcatcg tgcgccacga gtttccgcgg 
agtttagaaa attcattaag ttctgatgct 
aactggtaca attttagctt gttgctgtgc 
ctccttaccc agaataattc caagttccac 
ctcccctcca cccaggacct cttgagcttc 
agcacaccca cagtggtgat gtttggctgc 
attacaaccc agtttggggt catgccccct 
aatgtggagg aactgaggac agagggtctg 
acacagtctg tctttgagca ctacgtacaa 
gccacagcca ccatgatcca accagaactt 
gaggtggaaa ctacaaatct cacttcagga 
actttcagag gcctcagtgg ttccatcaga 
aacaactttt tcatctggaa tcttcaacat 
ttgggcagct ggcagggggg aaagattgtc 
cagagacaca aaacccactt ccaacatcca 
attgagcatc cttttgtctt cacaagggag 
caactctgtc tagaccccat gactaatgac 
ctccatagca gtaatgatac agtgcccatt 
attgatctgc tggaaaagat agcagaagac 
ggggatggaa agtatggagc atggaaaaat 
ctgagaggga ctgcccacat ggcagtcact 
gtgatagatt tcaccagccc tttcttctcc 
gatacagcag ctcccattgg agccttcatg 
atttttgtgg ctctgcacat cactgccgtc 
tttggtttga cttccaaggg gcgaaataga 
aacatctgtt atgccctctt gtttggcaga 
actggaaggt ttctaatgaa cctttgggcc 
acggcaaact tggctgctgt catggtaggt 
catgacccca agttacatca tccttcccaa 
agtgctgaag attatgtgag acaaagtttc 
aatgttccag ccacccctga tggagtggag 
gccttcatca tggacaaagc ccttctggat 
cttctcactg tggggaagcc atttgccata 
tctccattga ccgccaacat atccgagcta 



agcagggtct gtctgctgtt gccgccgccc 60 
tcctcctcgc acccgcagcc ctgccagatc 120 
ggcgcggtgc acttgcagcc ctggaccacc 180 
gacagccgag caggagccca gagggatgag 240 
ccctcgccgg gcgcacgctg gttggggagc 300 
cgtaagcccg gggagggcgc cagggcggag 360 
gccgtggaca acctgaaccg cgtggaaggg 420 
gtgatggcca tcgaggcagg cctgggcgat 480 
tcgccatgga gcagtgaccc tttctccttc 540 
caaggggtgt cggcgctgct cgccttcccc 600 
ttggtcagct tagtcctgca cattccagtg 660 
gagagtcaga atccccttca cctacaactg 720 
gatgtcactg tctcaatcct gaccatgaac 780 
caggaagact ggaacatcac cgatttcctc 840 
cttggttcta tcatcaacat caccgctaac 900 
ctacagatcc agcttgagag tattaagaac 960 
gacatggaaa gtatccggcg gattttcgaa 1020 
gaacttcgtt gggtgctggg agattcccag 1080 
cccttaggac tcattgctca tggaaaaaca 1140 
gatgctatgg agctggtcgc aagagctgta 1200 
gctctcattc ccagcacgat gaactgcatg 1260 
caatatttat caaggtttct agccaatacc 1320 
gtaaaaggtt ccaccatcgt cagctcagaa 1380 
gaccccatgg gaaagccaat gtggacccgc 1440 
atggactatg gaatatggcc agagcaggcc 1500 
agtaagctac acttgagagt ggttaccctg 1560 
gtagatgatg aaggcttgtg ccctgctggc 1620 
tcttccacat tggacagcct ttttagcagc 1680 
aaattcaaga agtgctgcta tggatattgc 1740 
atgaactttg acttcgacct ctatattgta 1800 
gggcactgga ctgggctagt gggtgatctc 1860 
tcctttagca tcaatactgc acggagccag 1920 
accagcttgg gcatcttagt gaggacccga 1980 
tggccactcc actggacaat gtggctgggg 2040 
ttcctcactc tgtatgaatg gaagagtcca 2100 
agtaaagtct tctccttttc ttcagccttg 2160 
acagtggcca tcaaacctcc aaaatgttgg 2220 
attttctgta tgttttgcct ttccacatac 2280 
gagaagatct atgaagagct ttctggaata 2340 
ggattccgct ttggaactgt ccgagaaagc 2400 
ccagagatgc atgaatatat gagaaggtac 2460 
tatctgaaga atgatccaga gaaactagac 2520 
tatgaagtgt caatagatgc tgactgcaaa 2580 
gaaggttacg gcattggcct cccacccaac 2640 
atcagtcaat acaagtcaca tgggtttatg 2700 
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gatatgctcc atgacaagtg gtacagggtg gttccctgtg gcaagagaag ttttgctgtc 2760 
acggagactt tgcaaatggg catcaaacac ttctctgggc tctttgtgct gctgtgcatt 2820 
ggatttggtc tgtccatttt gaccaccatt ggtgagcaca tagtatacag gctgctgcta 2880 
ccacgaatca aaaacaaatc caagctgcaa tactggctcc acaccagcca gagattacac 2940 
agagcaataa atacatcatt tatagaggaa aagcagcagc atttcaagac caaacgtgtg 3000 
gaaaagaggt ctaatgtggg accccgtcag cttaccgtat ggaatacttc caatctgagt 3060 
catgacaacc gacggaaata catctttagt gatgaggaag gacaaaacca gctaggcatc 3120 
cggatccacc aggacatccc cctccctcca aggagaagag agctccctgc cttgcggacc 3180 
accaatggga aagcagactc cctaaatgta tctcggaact cagtgatgca ggaactctca 3240 
gagctcgaga agcagattca ggtgatccgt caggagctgc agctggctgt gagcaggaaa 3300 
acggagctgg aggagtatca aaggacaagt cggacttgtg agtcctaggt gaccacactg 3360 
cttccctttc tcagttcctg accttcctct gagcccttga gacactttgt aatgctcttt 3420 
tgtaactatc gacaaaggtg tggggaagct gaggtctagg tcttcttaaa ggtcaagtct 3480 
gctctccctc gcctaaagtg cagcagcagc tcctctcaag ctcactctct aggtctccag 3540 
ggtaggagtg tttttctagc aagaatctta gtcaggagta agctctgtgc gagagatctg 3600 
tgaataacca gataacccca gctgccgtta accttttcac caggtgccac agtaatattt 3660 
ctggttttta gccctttctc tgcactacca acaagagata aaattgttac tcac 3714 

<210> 38 

<211> 1009 

<212> DNA 

<213> Homo sapiens 

. <220> 
<221> misc_feature 
<223> Incyte ID No: 7472747CB1 

<400> 38 

cgaggaggag ggcagggcag 
cctgtttccc acctctcccc 
catgcccagt gctgggctct 
ctatgtctgc tacctgctgc 
ggctcagtcc agggaccagt 
cctggaccag tgggccatgg 
tgtgaacccc aaaggcaact 
ctttgcaggc acagtcgtca 
aggtcaggtc ttctgtgtct 
caaccacctg ggcacagggc 
tcccaggcgc tcccaggagg 
gacgctggtc attctcatct 
cagcgagggc ttctactttg 
tgcaggcaca gaccccagca 
gatcctcctg ggcctggcgt 
cagatgctgc cagctctggc 
tgaccccagt gggctcccca 

<210> 39 
<211> 1155 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 7474121CB1 

<400> 39 

atggaggtct cggggcaccc ccaggccagg agatgctgcc cagaggccct gggaaagctc 60 
ttccctggcc tctgcttcct ctgctttctg gtgacctacg ccctggtggg tgctgtggtc 120 
ttctctgcca ttgaggacgg ccaggtcctg gtggcagcag atgatggaga gtttgagaag 180 
ttcttggagg agctctgcag aatcttgaac tgcagtgaaa cagtggtgga agacagaaaa 240 
caggatctcc aggggcatct gcagaaggtg aagcctcagt ggtttaacag gaccacacac 300 
tggtccttcc tgagctcgct ctttttctgc tgcacggtgt tcagcaccgt gggctatggc 360 
tacatctacc ccgtcaccag gcttggcaag tacttgtgca tgctctatgc tctctttggt 420 



gcggcagcct gggcacaggc 
catagccagc cccacggccc 
gcagctgctg gggtggccgg 
tcggtgccac tatcttccag 
ttcagttgga gaagctgcgc 
agcagtttgt gcaggtcatc 
ctaccaaccc cagcaactgg 
ctaccatagg ttatgggaac 
tctatgccct gttgggcatc 
tgcgtgccca tctggccgcc 
tactgcaagt cctgggcctg 
tcccacccat ggtcttcagc 
ctttcatcac tctcagcacc 
agcattatat ctcagtgtat 
ggctggcgct gatcctccca 
tgctcagtag gggcctcggc 
ggcctcagaa gatccccatc 



ccctaggtgc ttactcctca 60 
tggcagggtc ctggccacag 120 
gtgctgcccc tgctgctggc 180 
ctgctagaga ggcaggcgga 240 
ttcctggaga actacacctg 300 
atggaagcct gggtgaaagg 360 
gactttggca gcagtttctt 420 
ctggcaccca gcacagaggc 480 
ccgcttaacg tgatcttcct 540 
attgaaagat gggaggaccg 600 
gctctgttcc tgaccctggg 660 
catgtggagg gctggagctt 720 
attggctttg gggactatgt 780 
cggagcctgg cagccatctg 840 
ctgggccccc tgcttctgca 900 
gtcaaggatg gggcagcctc 960 
tctgcatga 1009 
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atccccctga tgttcctcgt tctcacggac 
acatcttata atcggttccg aaaattccct 
tgccccaaat ctctcttcaa gaaaaaaccg 
cagatcatca tcagtgctga agagcttcca 
ccaagctgca gcatggagct gtttgagaga 
caactgcccc cacaagccat ggagaggagt 
ctctcatact ccatcatcag caacctggat 
atccccctcc ccatcattgc ccttattgtt 
ctccccttct gggagacaca gttggatttc 
ctcaccacca ttgggtttgg ggatactgtt 
tccatttata tcatcgttgg aatggagatt 
aggctgattg acatatacaa aaatgttatg 
cttgttaaaa agtga 

<210> 40 

<211> 2733 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7475615CB1 



acaggcgaca tcctggcaac catcttatct 480 
ttctttaccc gccccctcct ctccaagtgg 540 
gaccccaagc ccgcagatga agctgtccct 600 
ggccccaaac ttggcacatg tccttcacgc 660 
tctcatgcgc tagagaaaca gaacacactg 720 
aactcgtgtc ccgaactggt gttgggaaga 780 
gaagttggac agcaggtgga gaggttggac 840 
tttgcctaca tttcctgtgc agctgccatc 900 
gagaatgcct tctatttctg ctttgtcaca 960 
ttagaacacc ctaacttctt cctgttcttc 1020 
gtgttcattg ctttcaagtt ggtgcaaaac 1080 
ctattctttg caaaagggaa gttttaccac 1140 

1155 



<400> 40 

cccttcattg agctccttca ccagcaccag 
cgaggcgatg ttggtgtggg ggaggttttt 
ccctgggcct gtgtaggagg ggatggtcag 
gaacaccttt cggctgcccg ctccccagac 
tcacccactg cttgtaaatg ccccagatat 
cagagccgca tactccctta ccctcttcga 
cccagtggga gagaaacttc gcaatgcctt 
ggtgtttggg ctgctgcctg tgctctcctg 
cattcctgac ctgctcggtg gactcagcgg 
atttgctctg ctggccaacc ttcctgcagt 
cctgacctac ttcttcctgg ggggtgttca 
cagcatcctg gtgggtaaca tctgtctgca 
caacaatgcc accaatgaga gctatgtgga 
cgtgtcagct acgctagcct gcctcaccgc 
gtttggcttt gtggccatct acctctccga 
cggcctgcag atcctgattt cggtgctcaa 
cacaggccca gggtccatcg tctttacctt 
caacatcgcc tcgctcatct tcgctctcat 
gctcaatgct cgctacatgc acaagattcg 
ggtggtggca acagctatct ccgggggctg 
cgtgggagaa atccaacgcg ggttccccac 
ggacatgata ggcacagcct tctccctagc 
gggccggacc ctggccaaca agcacggcta 
tctcggctgc agcaacttct ttggctcctt 
ttctgtcact ctggctgtgg atggagctgg 
gatcgctgtc aatctcaaga actccctcaa 
gaagagcaag ctggactgtt gcatctgggt 
cctgccctat ggtgtggcag tgggtgtcgc 
tcagtttcga aatggctatg cactggccca 
caagacctat aatagggccc aggatatcca 
tctctacttt gccaactcag agatcttcag 
gcaggagctg cagcaggact ttgagaatgc 
cccggctaac ggcaccagcg tgtcctatat 
ccagagtgag ccaccagcct ccgctgaggc 
cgtcccaccc ttcgtcacct tccacaccct 
ggacttgatg ggcatcaagg ccctggccaa 
gaaggtcttc ttggtgaaca tccatgccca 
ctttgaggat gggagtctag aatgcaagca 
ctttgcccag gcaaatgcta gagacgtgac 



gaaggcaccg ctgatgagag cgaagatgag 60 
gcaaatgtca atgaaggtaa agacgatgga 120 
tccgaagatg tacttgagca ccgaaatcag 180 
acatctgcag ccctgcccag ccggctttgc 240 
gagccagccc aggccccgct acgtggtaga 300 
cgatgagttt gagaagaagg accggacata 360 
cagatgttcc tcagccaaga tcaaagctgt 420 
gctccccaag tacaagatta aagactacat 480 
gggatccatc caggtcccac aaggcatggc 540 
caatggcctc tactcctcct tcttccccct 600 
ccagatggtg ccaggtacct ttgccgttat 660 
gctggcccca gagtcgaaat tccaggtctt 720 
cacagcagcc atggaggctg agaggctgca 780 
catcatccag atgggtctgg gcttcatgca 840 
gtccttcatc cggggcttca tgacggccgc 900 
gtacatcttc ggactgacca tcccctccta 960 
cattgacatt tgcaaaaacc tcccccacac 1020 
cagcggtgcc ttcctggtgc tggtgaagga 1080 
cttccccatc cctacagaga tgattgtggt 1140 
taagatgccc aaaaagtatc acatgcagat 1200 
cccggtgtcg cctgtggtct cacagtggaa 1260 
catcgtgagc tacgtcatca acctggctat 1320 
cgacgtggat tcgaaccagg agatgatcgc 1380 
ctttaaaatt catgtcattt gctgtgcgct 1440 
aggaaaatcc cagtctgtgc taggagccct 1500 
gcaactcacc gacccctact acctgtggag 1560 
agtgagcttc ctctcctcct tcttcctcag 1620 
cttctccgtc ctggtcgtgg tcttccagac 1680 
ggtcatggac actgacattt atgtgaatcc 1740 
ggggattaaa atcatcacgt actgctcccc 1800 
gcaaaaggtc atcgccaaga ctgtctccct 1860 
gccccccacc gaccccaaca acaaccagac 1920 
caccttcagc cctgacagct cctcacctgc 1980 
ccccggcgag cccagtgaca tgctggccag 2040 
catcctggac atgagtggag tcagcttcgt 2100 
gctgagctcc acctatggga agatcggcgt 2160 
ggtgtacaat gacattagcc atggaggcgt 2220 
cgtctttccc agcatacatg acgcagtcct 2280 
cccaggacac aacttccaag gggctccagg 2340 
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ggatgctgag ctctccttgt acgactcaga 
gcaggagatg ttcgggagca tgtttcacgc 
ccagtcctca tgctgcctac agagtgcctg 
ggggtcacag ggggtgtcgg gcggaggaaa 
cctctccccc tctctcctcc cttccttccc 
agcagggggg tgctaccctt acaggagtga 
ggcctggccg caatggacaa gcctcctgct 



ggaggacatt cgcagctact gggacttaga 2400 
agagaccctg accgccctgt gagggctcag 2460 
gcacttggga cttccataaa ggatgagcct 2520 
gtgcatcccc cagagcttgg gttcctctct 2580 
tccccgcatc tccagagaga gcctctcagc 2640 
gagtctggtg agcccactct tcacccgtca 2700 
cac 2733 



<210> 41 

<211> 3457 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7475656CB1 



<400> 41 

cgagtctgga gcccgcgccg tcgccggccg 
cccaactctt cgtctaacag ccgggacgat 
gcgccgggcg cggggccggc cgcggccgag 
ggggccggcg cgaaggagca cggcaactcc 
gaggagccgg cggggggctt cgaagacgcc 
cagaggcagt tcacctccat gctgcagccc 
gggagccaga aggcggtgga aaaggagcag 
atccaccctt acagtgattt caggttttac 
ggaaatctag tcatcatacc agttggaatc 
tggattattt tcaatgtggc atcagataca 
aggactggga ctgtcaatga agacagttct 
atgaattatt taaaaagctg gtttgtggtt 
atctttctta ttgtagaaaa aggaatggat 
cgcattgtga ggtttacaaa aattctcagt 
attagataca tacatcaatg ggaagagata 
gtggtgagaa tttttaatct catcggcatg 
cttcagttct tagtaccact actgcaggac 
gaaatggtta atgattcttg gggaaagcag 
cacatgctgt gcattgggta tggagcccaa 
accatgctga gcatgatcgt cggggccacc 
gctttaatcc agtctctgga ttcttcgagg 
gaacaataca tgtcattcca taagttacca 
tatgaacaca gataccaagg caaaatcttt 
gatcctctga gagaggagat agtcaacttc 
ttatttgcta atgcggatcc taattttgtg 
gtgtttcaac ctggagatta tatcatacga 
attcaacacg gtgttgctgg tgtcattaca 
ggctcttact ttggagagat ttgcctgctg 
gctgatacat attgtcgtct ttactcactt 
gaatatccaa tgatgaggag agcctttgag 
ggaaagaaaa attcaattct tctgcaaaag 
aacaatcagg agaacgaaat cctcaagcag 
gcaatcgctc ccatcaatta tcctcaaatg 
accccgacct cccgcatgag gacacaatct 
cacagcaacc tgcactcccc cagtcccagc 
tcaccctgct cctacaccac cgcggtctgc 
cgaactttcc actatgcctc ccccaccgcc 
cagcagcagg tacagcagtc ccagccgccg 
ccacagacac ctggcagctc cacgccgaaa 
cacaacacca acctgacccg ggaagtcagg 
catgaggtgt ccactctgat ttccagacct 
atccctcaac ccgtgacggc ggtccccgga 
gtcccgcagc gcgtcaccct cttccgacag 
ggagtccctc cagcaccccc tccaccagca 
ttaaacacag acccagacgc agaaaagcca 



cgtcctccgg gcatggaagg aggcggcaag 60 
ggcaacagcg tcttccccgc caaggcgtcc 120 
aagcgcctgg gcaccccgcc ggggggcggc 180 
gtgtgcttca aggtggacgg cggtggcggc 240 
gaggggcccc ggcggcagta cggcttcatg 300 
ggggtcaaca aattctccct ccgcatgttt 360 
gaaagggtta aaactgcagg cttctggatt 420 
tgggatttaa taatgcttat aatgatggtt 480 
acattcttta cagagcaaac aacaa caeca 540 
gttttcctat tggacctgat catgaatttt 600 
gaaatcatcc tggaccccaa agtgatcaag 660 
gacttcatct catccatccc agtggattat 720 
tctgaagttt acaagacagc cagggcactt 780 
etcttgegtt tattacgact ttcaaggtta 840 
ttccacatga catatgatct cgccagtgca 900 
atgctgctcc tgtgccactg ggatggttgt 960 
ttcccaccag attgctgggt gtctttaaat 1020 
tattcatacg cactcttcaa agctatgagt 1080 
gccccagtca gcatgtctga cctctggatt 1140 
tgetatgeca tgtttgtcgg ccatgccacc 1200 
eggcagtate aagagaagta taagcaagtg 1260 
getgatatge gtcagaagat acatgattac 1320 
gatgaggaaa atattctcaa tgaactcaat 1380 
aactgtcgga aactggtggc tacaatgect 1440 
actgccatgc tgagcaagtt gagatttgag 1500 
gaaggagccg tgggtaaaaa aatgtatttc 1560 
aaatccagta aagaaatgaa gctgacagat 1620 
accaaaggac gtcgtactgc cagtgttcga 1680 
tccgtggaca atttcaacga ggtcctggag 1740 
acagttgeca ttgaccgact agatcgaata 1800 
ttccagaagg atctgaacac tggtgttttc 1860 
attgtgaaac atgacaggga gatggtgcag 1920 
acaaccctga attccacatc gtctactacg 1980 
ccaccggtgt acacagcgac cagcctgtct 2040 
acacagaccc cccagccatc agccatcctg 2100 
agccctcctg tacagagccc tctggccgct 2160 
tcccagctgt cactcatgca acagcagccg 2220 
cagactcagc cacagcagcc gtccccgcag 2280 
aatgaagtgc acaagagcac geaggegett 2340 
ccactctccg cctcgcagcc ctcgctgccc 2400 
catcccactg tgggcgagtc cctggcctcc 2460 
acgggccttc aggcaggggg caggagcact 2520 
atgtcgtcgg gagccatccc cccgaaccga 2580 
gctgctcttc caagagaatc ttcctcagtc 2640 
egatttgett caaatttatg atccctgctg 2700 
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attgtcaaag cagaaagaaa tactctcata aactgagact atactcagat cttattttat 2760 

tctatctcct gatagatccc tctagcctac tatgaagaga tattttagac agctgtggcc 2820 

tacacgtgaa atgtaaaaat atatatacat atactataaa atatatatct aaattcccaa 2880 

gagagggtca aaagacctgt ttagcattca gtgttatatg tcttcctttc tttaaatcat 2940 

taaaggattt aaaatgtcgt tgtaagatta tttatttcta acctactttt acttaagtcc 3000 

tttgatatgt atatttctct attttatgaa gagttcttgg attcaatgga aacaaaactg 3060 

attttaaaaa ggcaactcaa atgaactagt aaatagcacc aatcaaaact ttctttcatt 3120 

agctgtgtct ctgcatctaa attgttaatc attaatggtg gagaattaaa taacaaatcc 3180 

cattttatag atctaaattg tatttcggtg ctttcaattt caaattaggt taaagaatgc 3240 

actacttgct tggccaccgt aggagactag cattgccact gtttgttaag aatatcacta 3300 

acctcaaaca tgttcattga tctttcagaa agctgaggga aaattaatat ttgtcttcat 3360 

gtgttatcgg acttttacca agactcgatc aatgttagtt gtaaataact ttttcaaccc 3420 

aaataaaaat agctattctg tgttgtaaaa aaaaaaa 3457 

<210> 42 
<211> 5622 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7480632CB1 

<400> 42 

ctggacaagg agaaaaacat agggaaaaaa ccaacagaat ttgttggcat gttctacaca 60 
cagaccatgg cttttcagaa gccaagctga ataaaaacag ttttaaaaga ggcaaccatt 120 
tgtagaggag tccttgaagg attcttcatt gttttcttgg acaaaaagag accagtggat 180 
ccaagtgctt caaatacttc tctcttattt tcttaactct attgctctgc aatatttact 240 
ttaccctgtt aatgaacagg acaaaatggt taaaaaagag ataagcgtgc gtcaacaaat 300 
tcaggctctt ctgtacaaga attttcttaa aaaatggaga ataaaaagag agtttctgga 360 
ggaatggaca ataacattgt ttctagggct atatttgtgc atcttttcgg aacacttcag 420 
agctacccgt tttcctgaac aacctcctaa agtcctggga agcgtggatc agtttaatga 480 
ctctggcctg gtagtggcat atacaccagt cagtaacata acacaaagga taatgaataa 540 
gatggccttg gcttccttta tgaaaggaag aacagtcatt gggacaccag atgaagagac 600 
catggatata gaacttccaa aaaaatacca tgaaatggtg ggagttatat ttagtgatac 660 
tttctcatat cgcctgaagt ttaattgggg atatagaatc ccagttataa aggagcactc 720 
tgaatacaca ggtcactgtt gggccatgca tggtgaaatt ttttgttact tggcaaagta 780 
ctggctaaaa gggtttgtag cttttcaagc tgcaattaat gctgcaatta tagaagtaac 840 
aacaaatcat tctgtaatgg aggagttgac atcagttatt ggaataaata tgaagatacc 900 
acctttcatt tctaagggag aaattatgaa tgaatggttt cattttactt gcttagtttc 960 
tttctcttct tttatatact ttgcatcatt aaatgttgca agggaaagag gaaaatttaa 1020 
gaaactgatg acagtgatgg gtctccgaga gtcagcattc tggctctcct ggggattgac 1080 
atacatttgc ttcatcttca ttatgtccat ttttatggct ctggtcataa catcaatccc 1140 
aattgtattt catactggct tcatggtgat attcacactc tatagcttat atggcctttc 1200 
tttggtggca ttggctttcc tcatgagtgt tttaataagg aaacctatgc tcgctggttt 1260 
ggctggattt ctcttcactg tattttgggg atgtctggga ttcactgtgt tatatagaca 1320 
acttccttta tctttgggat gggtattaag tcttcttagc ccttttgcct tcactgctgg 1380 
aatggcccag attacacacc tggataatta cttaagtggt gttatttttc ctgatccctc 1440 
tggggattca tacaaaatga tagccacttt tttcattttg gcatttgata ctcttttcta 1500 
tttgatattc acattatatt ttgagcgagt tttacctggt aaggatggcc atggggattc 1560 
tccattattt ttccttaagt cctcattttg gtccaaacat caaaatactc atcatgaaat 1620 
ctttgagaat gaaataaatc ctgagcattc ctctgatgat tcttttgaac cggtgtctcc 1680 
agaattccat ggaaaagaag ccataagaat cagaaatgtt ataaaagaat ataatggaaa 1740 
gactggaaaa gtagaagcat tgcaaggcat attttttgac atatatgaag gacagatcac 1800 
tgcaatactt gggcataatg gagctggtaa atcaacactg ctaaacattc ttagtggatt 1860 
gtctgtttct acagaaggat cagccactat ttataatact caactctctg aaataactga 1920 
catggaagaa attagaaaga atattggatt ttgtccacag ttcaattttc aatttgactt 1980 
cctcactgtg agagaaaacc tcagggtatt tgctaaaata aaagggattc agccaaagga 2040 
agtggaacaa gaggtattgc tgctagatga accaactgct ggattggatc ccttttcaag 2100 
acaccgagtg tggagcctcc tgaaggagca taaagtagac cgacttatcc tcttcagtac 2160 
ccaattcatg gatgaggctg acatcttggc tgataggaaa gtatttctgt ctaatgggaa 2220 
gttgaaatgt gcaggatcat ctttgtttct gaagcgaaag tggggtattg gatatcattt 2280 
aagtttacac aggaatgaaa tgtgtgacac agaaaaaatc acatccctta ttaagcagca 2340 
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cattcctgat gccaagttaa caacagaaag tgaagaaaaa cttgtatata gtttgccttt 2400 
ggaaaaaacg aacaaatttc cagatcttta cagtgacctt gataagtgtt ctgaccaggg 2460 
cataaggaat tatgctgttt cagtgacatc tctgaatgaa gtattcttga acctagaagg 2520 
aaaatcagca attgatgaac cagattttga cattgggaaa caagagaaaa tacatgtgac 2580 
aagaaatact ggagatgagt ctgaaatgga acaggttctt tgttctcttc ctgaaacaag 2640 
aaaggctgtc agtagtgcag ctctctggag acgacaaatc tatgcagtgg caacacttcg 2700 
cttcttaaag ttaaggcgtg aaaggagagc tcttttgtgt ttgttactag tacttggaat 2760 
tgcttttatc cccatcattc tagagaagat aatgtataaa gtaactcgtg aaactcattg 2820 
ttgggagttt tcacccagta tgtatttcct ttctctggaa caaatcccga agacgcctct 2880 
taccagcctg ttaatcgtta ataatacagg atcaaatatt gaagacctcg tgcattcact 2940 
gaagtgtcag gatatagttt tggaaataga tgactttaga aacagaaatg gctcagatga 3000 
tccctcctac aatggagcca tcatagtgtc tggtgaccag aaggattaca gattttctgt 3060 
tgcgtgtaat accaagaaat tgaattgttt tcctgttctt atgggaattg ttagcaatgc 3120 
ccttatggga atttttaact tcacggagct tattcaaatg gagagcactt catttttttt 3180 
ttacataacc acaaaatctt ttcaaactaa gatcccttcc tctatccctt ctattctttg 3240 
tcagaaaaat gttcaatccc agttatggat ttcaggcctc tggccttcag catactggtg 3300 
tggacaggct ctggtggaca ttccattata cttcttgatt ctcttttcaa tacatttaat 3360 
ttactacttc atatttctgg gattccagct ttcatgggaa ctcatgtttg ttttggtggt 3420 
atgcataatt ggttgtgcag tttctcttat attcctcaca tatgtgcttt cattcatctt 3480 
tcgcaagtgg agaaaaaata atggcttttg gtcttttggc ttttttattg taagtatata 3540 
tacagacttt agctttcatt acaatgtttc taggtgtgat tttctattta tctttatttt 3600 
tgtatgttta tttattgctc atcatttttc tttctgttct ccataccttc agagtgttat 3660 
tttccttttt gtcataaggt gtctggaaat gaagtatgga aatgaaataa tgaataaaga 3720 
cccagttttc agaatctctc cacggagtag agaaactcat cccaatccgg aagagcccga 3780 
agaagaagat gaagatgttc aagctgaaag agtccaagca gcaaatgcac tcactgctcc 3840 
aaacttggag gaggaaccag tcataactgc aagctgttta cacaaggaat attatgagac 3900 
aaagaaaagt tgcttttcaa caagaaagaa gaaaatagcc atcagaaatg tttccttttg 3960 
tgttaaaaaa ggtgaagttt tgggattact aggacacaat ggagctggta aaagtacttc 4020 
cattaaaatg ataactgggt gcacaaagcc aactgcagga gtggtggtgt tacaaggcag 4080 
cagagcatca gtaaggcaac agcatgacaa cagcctcaag ttcttggggt actgccctca 4140 
ggagaactca ctgtggccca agcttacaat gaaagagcac ttggagttgt atgcagctgt 4200 
gaaaggactg ggcaaagaag atgctgctct cagtatttca cgattggtgg aagctcttaa 4260 
gctccaggaa caacttaagg ctcctgtgaa aactctatca gagggaataa agagaaagct 4320 
gtgctttgtg ctgagcatcc tggggaaccc atcagtggtg cttctagatg agccgttcac 4380 
cgggatggac cccgaggggc agcagcaaat gtggcagata cttcaggcta ccgttaaaaa 4440 
caaggagagg ggcaccctct tgaccaccca ttacatgtca gaggctgagg ctgtgtgtga 4500 
ccgtatggcc atgatggtgt caggaacgct aaggtgtatt ggttccattc aacatctgaa 4560 
aaacaagttt ggtagagatt atttactaga aataaaaatg aaagaaccta cccaggtgga 4620 
agctctccac acagagattt tgaagctttt cccacaggct gcttggcagg aaagatattc 4680 
ctctttaatg gcgtataagt tacctgtgga ggatgtccac cctctatctc gggccttttt 4740 
caagttagag gcgatgaaac agaccttcaa cctggaggaa tacagcctct ctcaggctac 4800 
cttggagcag gtattcttag aactctgtaa agagcaggag ctgggaaatg ttgatgataa 4860 
aattgataca acagttgaat ggaaacttct cccacaggaa gacccttaaa atgaagaacc 4920 
tcctaacatt caattttagg tcctactaca ttgttagttt ccataattct acaagaatgt 4980 
ttccttttac ttcagttaac aaaagaaaac atttaataaa cattcaataa tgattacagt 5040 
tttcattttt aaaaatttag gatgaaggaa acaaggaaat atagggaaaa gtagtagaca 5100 
aaattaacaa aatcagacat gttattcatc cccaacatgg gtctattttg tgcttaaaaa 5160 
taatttaaaa atcatacaat attaggttgg ttatcggtta ttatcaataa agctaacact 5220 
gagaacattt tacaaataaa aatatgaggt ttttagcctg aacttcaaat gtatcagcta 5280 
tttttaaaca ttatttactc ggattctaat ttaatgtgac attgactata agaaggtctg 5340 
ataaactgat gaaatggcac agcataacat ttaattataa tgacattctg attataaaat 5400 
aaattgcatg tgaattttag tacatattga agttatatgg aagaagatag ccataatctg 5460 
taagaaagta ccgcagttta atattttctt tagccaactt atattcatgt atttttatgg 5520 
atctttttca aggtagtatc agtaggctag tcatttcgtt ctttcactca cgtcagaact 5580 
tccatgtttt tgcctggttt atgggtagtt aggttggtac ca 5622 

<210> 43 

<211> 2600 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
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<223> Incyte ID No: 6952742CB1 



<400> 43 

cccgtccagt ggaccccctc acgatggatt 
taccgggcag ggggctggga; cccgcgctgc 
ctgggtgggc gaggggtccg ccccaagggg 
ccctggatcc gggggctcct gtgaccatgc 
cctgacgcaa caggatggac gagtcccctg 
cggtccgacg ccagcgccca gcaccccggg 
ggtgcagctg ctcgtgcagt gtgctgtgcg 
ccacgcgctg gctgcgtcag taccgcccgc 
ggctggtcat cggcatcatc ctggccatcg 
tctacagcct ctatacgtcc ttcttcgcca 
ggcatgtctc cgtgggcatc ttcagcctgc 
gggagctcca gctggccggc tttgacccct 
gcagcaccct caacggctcg gctgccatgc 
gtgtcgccac cgccctcacg ctgatgaccg 
ggctgggctt cgtgtccgcc tacctctcac 
cctccgtgac catcctgacc tcgcagctca 
accaggggcc cggcatggtg gtcctcacat 
ccaacgtgtg cgacgtggtc accagcacgg 
agctctcaga ccgctaccga caccgcctga 
tcgtggtggc cacactcgtg tcgcacttcg 
tggctggcga catccccacg ggtttcatgc 
agcgtgtggc tttggatgcc gtggccctgg 
tggcggagat gttcgcccgc agtcacggct 
ctgtgggctg ctgcaacgtg ctacccgcct 
tggccaagag cctggtgaag acagccactg 
gcgccaccgt ggtgctgctg gtgctgctgg 
gaagcgtgct ggcctgcgtc atcgtggtca 
acctcccgcg gttgtggcgg atgagcccgg 
ccacctgtat gctggtcagc acagaggccg 
tcagcctggc cggccgcacc caaagccacg 
cggccttcta cgaggatgcc acagagttcg 
tgttccgctt tggggggccg ctgtactatg 
acagcctcac ggggctggac gcagggtgca 
agacgggggt cggtgaggga ggccctgccc 
gggctgcgct ggtgcccgca gcggccggct 
tgctgttcct agacgcagcc ggtgtgagca 
ccctgggcat cagcctgctg ctagcctgct 
gaggaggctt cctcggggag ggccccgggg 
gtgtgcacga tgccgtgcag acagcacgag 
cccatctgta gcagggccag gcctgcccag 
cagacgtctg caagccactg ctgagaccct 
tcttgtgcca cagctgccct ggggaaaccg 
gatcacacgc aggacccaaa cactcagaaa 
cccacagtgc tggctgggcc 



ctggatccag cagggcccag cgtccatcca 60 
caggagaagg cccagcacca atccccggac 120 
cccgttgctg ccggggacct tgtcgtttgg 180 
cctcttctcg gccgcaggtc ggccacggga 240 
agcctctgca gcagggcaga gggccggtgc 300 
gtctgcgtga gatgctgaag gccaggctgt 360 
tccgggcgct ggtgcaggac ctgctccccg 420 
gggagtacct ggcaggcgac gtcatgtctg 480 
cctactcatt gctggccggg ctgcagccca 540 
acctcatcta cttcctcatg ggcacctcac 600 
tttgcctcat ggtggggcag gtggtggacc 660 
cccaggacgg cctgcagccc ggagccaaca 720 
tggactgcgg gcgtgactgc tacgccatcc 780 
ggctttacca ggtcctcatg ggcgtcctcc 840 
agccactgct cgatggcttt gccatggggg 900 
aacacctgct gggcgtgcgg atcccgcggc 960 
ggctgagcct gctgcgcggc gccgggcagg 1020 
tgtgcctggc ggtgctgcta gccgcgaagg 1080 
gggtgccgct gcccacggag ctgctggtca 1140 
ggcagctcca caagcgcttt ggctcgagcg 1200 
cccctcaggt cccagagccc aggctgatgc 1260 
ccctcgtggc tgccgccttc tccatctcgc 1320 
actctgtgcg tgccaaccag gagctgctgg 1380 
tcctccactg cttcgccacc agcgccgccc 1440 
gctgccggac acagctgtcc agcgtggtca 1500 
cgctggcacc gctgttccac gacctacagc 1560 
gcctgcgggg ggccctgcgc aaggtgtggg 1620 
ctgacgcgct ggtctgggca ggcaccgtgg 1680 
ggctgctggc tggcgtcatc ctctcgctgc 1740 
gcaccgccct gctggcccgc atcggggaca 1800 
agggcctcgt ccctgagccc ggcgtgcggg 1860 
ccaacaagga cttcttcctg cagtcactct 1920 
tggctgccag gaggaaggag gggggctcag 1980 
agggcgagga cctgggcccg gttagcacca 2040 
tccacacagt ggtcatcgac tgcgccccgc 2100 
cgctgcagga cctgcgccga gactacgggg 2160 
gcagcccgcc tgtgagagac attctgagca 2220 
acacggctga ggaggagcag ctgttcctca 2280 
cccgccacag ggagctggag gccaccgatg 2340 
cagcctctgc tccctcctgg ggacccacag 2400 
tcccagggag gagccaccca agagctgcac 2460 
gggaacccca actgggaaag gaggccctct 2520 
tcaagaacct ctgcctccga gacaggctgg 2580 

2600 



<210> 44 

<211> 2917 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7478795CB1 



<400> 44 

gcagcgggaa gagcggagcg aggaccgcgt 
actgcacccc agacccgagc ctgctgcgcg 
aacaggcctg gcctcgccct gtggatgatg 
ccttcagcac ccgcccacct ccaaccagca 
ctttggcctt catgagtgtg gacatctgcg 



ccggcgcagt cttcaatgag cagcgcggaa 60 
ccccctccca gagctcacct ggtgccaggt 120 
atggccttgc ccccgtgagc tacaacctgg 180 
ggatgcggct gtggaaggcg gtggtggtga 240 
tgaccacggc catctatgtc ttcagccacc 300 
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tggaccgcag cctcctggag gacatccgcc acttcaacat ctttgactcg gtgctggatc 360 
tctgggcagc ctgcctgtac cgcagctgcc tgctgctggg agccaccatt ggtgtggcca 420 
agaacagtgc gctggggccc cggcggctgc gggcctcgtg gctggtcatc accctcgtgt 480 
gcctcttcgt gggcatctat gccatggtga agctgctgct cttctcagag gtgcgcaggc 540 
ccatccggga cccctggttt tgggccctgt tcgtgtggac gtacatttca ctcggcgcat 600 
ccttcctgct ctggtggctg ctgtccaccg tgcggccagg cacccaggcc ctggagccag 660 
gggcggccac cgaggctgag ggcttccctg ggagcggccg gccaccgccc gagcaggcgt 720 
ctggggccac gctgcagaag ctgctctcct acaccaagcc cgacgtggcc ttcctcgtgg 780 
ccgcctcctt cttcctcatc gtggcagctc tgggagagac cttcctgccc tactacacgg 840 
gccgcgccat tgatggcatc gtcatccaga aaagcatgga tcagttcagc acggctgtcg 900 
tcatcgtgtg cctgctggcc attggcagct catttgccgc aggtattcgg ggcggcattt 960 
ttaccctcat atttgccaga ctgaacattc gccttcgaaa ctgtctcttc cgctcactgg 1020 
tgtcccagga gacaagcttc tttgatgaga accgcacagg ggacctcatc tcccgcctga 1080 
cctcggacac caccatggtc agcgacctgg tctcccagaa catcaatgtc ttcctgcgga 1140 
acacagtcaa ggtcacgggc gtggtggtct tcatgttcag cctctcatgg cagctctcct 1200 
tggtcacctt catgggcttc cccatcatca tgatggtgtc caacatctac ggcaagtact 1260 
acaagaggct ctccaaagag gtccagaatg ccctggccag agcgagcaac acggcggagg 1320 
agaccatcag tgccatgaag actgtccgga gcttcgccaa tgaggaggag gaggcagagg 1380 
tgtacctgcg gaagctgcag caggtgtaca agctgaacag gaaggaggca gctgcctaca 1440 
tgtactacgt ctggggcagc gggctcacac tgctggtggt ccaggtcagc atcctctact 1500 
acgggggcca ccttgtcatc tcaggccaga tgaccagcgg caacctcatc gccttcatca 1560 
tctacgagtt tgtcctggga gattgtatgg agtccgtggg ctccgtctac agtggcctga 1620 
tgcagggagt gggggctgct gagaaggtgt tcgagttcat cgaccggcag ccgaccatgg 1680 
tgcacgatgg cagcttggcc cccgaccacc tggagggccg ggtggacttt gagaatgtga 1740 
ccttcaccta ccgcactcgg ccccacaccc aggtcctgca gaatgtctcc ttcagcctgt 1800 
cccccggcaa ggtgacggcc ctggtggggc cctcgggcag tgggaagagc tcctgtgtca 1860 
acatcctgga gaacttctac cccctggagg ggggccgggt gctgctggac ggcaagccca 1920 
tcagcgccta cgaccacaag tacttgcacc gtgtgatctc cctggtgagc caggagcccg 1980 
tgctgttcgc ccgctccatc acggataaca tctcctacgg cctgcccact gtgcctttcg 2040 
agatggtggt ggaggccgca cagaaggcca atgcccacgg cttcatcatg gaactccagg 2100 
acggctacag cacagagaca ggggagaagg gcgcccagct gtcaggtggc cagaagcagc 2160 
gggtggccat ggcccgggct ctggtgcgga accccccagt cctcatcctg gatgaagcca 2220 
ccagcgcttt ggatgccgag agcgagtatc tgatccagca ggccatccat ggcaacctgc 2280 
agaagcacac ggtactcatc atcgcgcacc ggctgagcac cgtggagcac gcgcacctca 2340 
ttgtggtgct ggacaagggc cgcgtagtgc agcagggcac ccaccagcag ctgctggccc 2400 
agggcggcct ctacgccaag ctggtgcagc ggcagatgct ggggcttcag cccgccgcag 2460 
acttcacagc tggccacaac gagcctgtag ccaacggcag tcacaaggcc tgatgggggg 2520 
cccctgcttc tcccggtggg gcagaggacc cggtgcctgc ctggcagatg tgcccacgga 2580 
ggcccccagc tgccctccga gcccaggcct gcagcactga aagacgacct gccatgtccc 2640 
atggatcacc gcttcctgca tcttgcccct ggtccctgcc ccattcccag ggcactcctt 2700 
acccctgctg ccctgagcca acgccttcac ggacctccct agcctcctaa gcaaaggtag 2760 
agctgccttt ttaaacctag gtcttaccag ggtttttact gtttggtttg aggcacccca 2820 
gtcaactcct agatttcaaa aacctttttc taattgggag taatggcggg cactttcacc 2880 
aagatgttct agaaacttct gagccaggag tgaatgg 2917 

<210> 45 

<211> 1474 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 656293CB1 

<400> 45 

atggggctcc ggagccacca cctcagcctg ggccttctgc ttctgtttct actccctgca 60 
gagtgcctgg gagctgaggg ccggctggct ctcaagctgt tccgtgacct ctttgccaac 120 
tacacaagtg ccctgagacc tgtggcagac acagaccaga ctctgaatgt gaccctggag 180 
gtgacactgt cccagatcat cgacatggat gaacggaacc aggtgctgac cctgtatctg 240 
tggatacggc aggagtggac agatgcctac ctacgatggg accccaatgc ctatggtggc 300 
ctggatgcca tccgcatccc cagcagtctt gtgtggcggc cagacatcgt actctataac 360 
aaagccgacg cgcagcctcc aggttccgcc agcaccaacg tggtcctgcg ccacgatggc 420 
gccgtgcgct gggacgcgcc ggccatcacg cgcagctcgt gccgcgtgga tgtagcagcc 480 
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ttcccgttcg acgcccagca 
caactggatg tgcggccgcg 
gagtggcgcg tgctgggcat 
gagccctacc ccgacgtcac 
tgcaacctgc tgctgccctg 
cctgccgact caggcgagaa 
ttccagttgc tgctggccga 
tactacatgg ccactatgac 
aacctgcatt actgtggtcc 
ctgggacacc tggcacgggg 
aggccacctg agttatctcc 
ggcccttgcc acgagccacg 
accattgcca ataccttccg 
cgcctggccc gtgtgatgga 
atgagcctcc tggtgctggt 
ctgcagccac agctcctcca 
ccagccagtc tctccccact 

<210> 46 
<211> 1742 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 7473957CB1 

<400> 46 

tgggctcggc tccctgcctc cgcgtcgcag cccccgccgt agccgcctcc gagcccgccg 60 
ccacatcctc tgagcagaag atggctgtgc cacccacgta tgccgatctt ggcaaatctg 120 
ccagggatgt cttcaccaag ggctatggaa tttacaagct caggctcagc caacactgag 180 
accaccaaag tgacgggcag tctggaaacc aagtacagat ggactgagta cggcctgacg 240 
tttacagaga aatggaatac cgacaataca ctaggcaccg agattactgt ggaagatcag 300 
cttgcacgtg gactgaagct gaccttcgat tcatccttct cacctaacac tgggaaaaaa 360 
aatgctaaaa tcaagacagg gtacaagcgg gagcacatta acctgggctg cgacatggat 420 
ttcgacattg ctgggccttc catccggggt gctctggtgc taggttacga gggctggctg 480 
gccggctacc agatgaattt tgagactgca aaatcccgag tgacccagag caactttgca 540 
gttggctaca agactgatga attccagctt cacactaatg tgaatgacgg gacagagttt 600 
ggcggctcca tttaccagaa agtgaacaag aagttggaga ccgctgtcaa tcttgcctgg 660 
acagcaggaa acagtaacac gcgcttcgga atagcagcca agtatcagat tgaccctgac 720 
gcctgcttct cggctaaagt gaacaactcc agcctgatag gtttaggata cactcagact 780 
ctaaagccag gtattaaact gacactgtca gctcttctgg atggcaagaa cgtcaatgct 840 
ggtggccaca agcttggtct aggactggaa tttcaagcat aaatgaatac tgtacaattg 900 
tttaatttta aactattttg cagcatagct accttcagaa tttagtgtat cttttaatgt 960 
tgtatgtctg ggatgcaagt attgctaaat atgttagccc tccaggttaa agttgattca 1020 
gctttaagat gttacccttc cagaggtaca gaagaaacct atttccaaaa aaggtccttt 1080 
cagtggtaga ctcggggaga acttggtggc ccctttgaga tgccaggttt cttttttatc 1140 
tagaaatggc tgcaagtgga agcggataat atgtaggcac tttgtaaatt catattgagt 1200 
aaatgaatga aattgtgatt tcctgagaat cgaaccttgg ttccctaacc ctaattgatg 1260 
agaggctcgc tgcttgatgg tgtgtacaaa ctcacctgaa tgggactttt ttagacagat 1320 
cttcatgacc tgttcccacc ccagttcatc atcatctctt ttacaccaaa aggtctgcag 1380 
ggtgtggtaa ctgtttcttt tgtgccattt tggggtggag aaggtggatg tgatgaagcc 1440 
aataattcag gacttattcc ttcttgtgtt gtgttttttt ttggcccttg caccagagta 1500 
tgaaatagct tccaggagct ccagctataa gcttggaagt gtctgtgtga ttgtaatcac 1560 
atggtgacaa cactcagaat ctaaattgga cttctgttgt attctcacca ctcaatttgt 1620 
tttttagcag ttaatgggta cattttagag tctccatttt gttggaatta gatcctccct 1680 
tcaagggctg gattacacac ttaaaaactg attaatattg acttaaaaaa aagggccgca 1740 
ta 1742 

<210> 47 
<211> 2312 
<212> DNA 

<213> Homo sapiens 



ctgcggcctg acgttcggct 
cggcgcfcgca gccagcctgg 
gccggcgcgg cggcgcgtgc 
cttcacgctg ctgctgcgcc 
cgtgctcatc tcgctgcttg 
ggtgtcgctg ggcgtcaccg 
gagcatgcca ccggccgaga 
catggtcaca ttctcaacag 
cagtgtccgc ccagtgccag 
cctgtgcgtg cgggaaagag 
tagcccccag tcgcctgaag 
atgtctgtgc cgccaggaag 
cagccaccga gctgcccagc 
ccgcttcttc ctggccatct 
gcaggccctg tgagggctgg 
gaaagggaca gccacggcca 
gctcctaaga tcct 



cctggactca cggcgggcac 540 
cggacttcgt ggagaacgtg 600 
tcacctacgg ctgctgctcc 660 
gccgcgccgc cgcctacgtg 720 
cgccgctcgc cttccacctg 780 
tgctgctggc gctcaccgtc 840 
gcgtgccgct catcgggaag 900 
cactcaccat ccttatcatg 960 
cctgggctag ggccctcctg 1020 
gggagccctg tgggcagtcc 1080 
gaggggctgg ccccccagcg 1140 
ccctactgca ccacgtagcc 1200 
gctgccatga ggactggaag 1260 
tcttctccat ggccctggtc 1320 
gactaagtca cagggatctg 1380 
agtggttgct ggtctttggg 1440 

1474 
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<220> 

<221> mis cofeature 

<223> Incyte ID No: 7474111CB1 



<400> 47 

ccaagagaaa ggctgttttt gccggtgaca 
ctaagggcag catctcgaga gggtccaggc 
cgcgccccag aacgtggggg ccggggccgg 
aggcggcacc gaggctgggg gatggggcgt 
gtcgccccgc gcggcggtag gaagagcaaa 
ttccctgaga aatccttagg ggcgatgtca 
gaaaaagctc tccaaacgcc gtctgtcccc 
gggctgttaa aaggggccct ggggactgcc 
cgggagcagg catcccggcg ctgggctccc 
cactacgggc cggagcggag ggggaggacg 
ggcggggcgc ccgggaatcc cagcccgagc 
gcggcttggg agagggcacc gcggcctcgg 
gagcggacac agccgcgttc ccagccggtg 
ccggccaccc gcagcggggc ggcctcggtg 
ctgtcccggg agctgctgaa ggacttcccg 
cgctccgagc gcgacgtgct cgaggtgtgc 
ttcttcgacc ggcactcgga ggccttcggc 
cgctggctgg agcgcatgcg gcggaccttc 
atcctggcta gcgtgtcggt ggtgttcgtg 
acgttgcccg actggcgcaa cgcagccgcc 
ataattgaag ctatctgcat aggttggttc 
tccaaaaaca agtgtgagtt tgtcaagaga 
acgccgtatt acatctctgt gttgatgaca 
agggctggag tcaccttgag ggtacttaga 
gcccgtcact tcattggtct tcagacactc 
atggttatgt tacttgtctt catttgtgtt 
cttcttgaac atgggctgga cctggaaaca 
gcctgctggt gggtgattat ctctatgact 
acagtgcctg gaagaattct tggaggagtt 
ttacctatca cttttatcta ccatagcttt 
tctgctaggt atagtaggag cctctccact 
ttcttgcata cacttcatag aaagactttg 
gggtgagcac tgcagtggca ttgtcatcat 
cgaagggata aaacagttta cttgttatgg 
ataatgactc ctagagtaaa ctttaggacc 
cttgaatgat tacacatttt taaaaaatac 
ctattttcca aatgtttttc catcttatga 
ttttgtttga gagtaacatt ttcatttcta 
gaagtatcct ggaaacatat gtcacatgcg 



aggcgttcct ccccagactt tcatcccact 60 
gcgctacacg ttaggcgcct tctcaggact 120 
gtcggggacc gcttccctcc gcgggctggc 180 
gtagccgccc tctgagcacg cggtacgtgg 240 
gtcggcagga aaagccgtgg ctgggatgcc 300 
gaacccgagc tgggctccgg acagtttctg 360 
gcacccgagt ccacactggg ctttgaacca 420 
caattcatcc cgatggccca gggcaggacg 480 
cgctcccccg ccctgcggac ccctccccgc 540 
gcgtcacgag gcggggagcc cgaggtccag 600 
aagccgggga gccctcaggg ggtcggcccc 660 
tgtgcgcagc cctcgggcgc gagggtcggc 720 
gggctcagcc gtggcgccgg cgaggactcc 780 
gtgctgaacg tgggcggcgc ccggtattcg 840 
ctgcgccgcg tgagccggct gcacggctgc 900 
gacgactacg accgcgagcg caacgagtac 960 
ttcatcctgc tctacgcggc tccctccagg 1020 
gaggagccca cgtcgtcgct ggccgcgcag 1080 
atcgtgtcca tggtggtgct gtgcgccagc 1140 
gacaaccgca gcctggatga ccggagcagg 1200 
actgccgagt gcatcgtgag gttcattgtc 1260 
cccctgaaca tcattgattt actggcaatc 1320 
gtgtttacag gcgagaactc tcaactccag 1380 
atgatgagga ttttttgggt gattaagctt 1440 
ggtttgactc tcaaacgttg ctaccgagag 1500 
gccatggcaa tctttagtgc actttctcag 1560 
tccaacaagg actttaccag cattcctgct 1620 
acagttggct atggagatat gtatcctatc 1680 
tgtgttgtca gtggaattgt tctattggca 1740 
gtgcagtgtt atcatgagct caagtttaga 1800 
gaattcctga attaatgcat tgcaaatcaa 1860 
atgctgcttc atatttatgt gtttcttgct 1920 
cttggtaggg taaaaattat ccttcccagc 1980 
agtaaataga attgagactg caaaggaaga 2040 
cggttttatt tagacttgtt ttcccgtttc 2100 
attatttgaa cattttaaaa cagaaaggta 2160 
attcagaaga agcttggaac ttatagtgtt 2220 
aatgttttat aatttctcat atcaatgtca 2280 
ag 2312 



<210> 48 

<211> 2320 

<212> DMA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7480826CB1 



<400> 48 

cccgccgctc gcagggctgc tccacagccg 
agtactgcta gcagcagccc gaccacgcgt 
ggcggacgcc ggaggattgg ggcggcaatt 
tcctgcctgt tgttggattt ggggaaattt 
tgagagaaac ccactgaaga cgtctgcgtg 
gccgctgcac ccaccgccaa ggacaaaagg 
tcccagtgct tagcatgaag aaggccgaaa 
acagcagcag ctacagttcc aacagcgact 



cgcgacgccg ccgccttaga acgcctttcc 60- 
taccgcacgc tcgcgccttt cccttgacac 120 
tgtcttttcc ttttttatta aaattatttt 180 
tttgtttgtt ttttatgatt tgtatttgac 240 
agaatagaga ccaccgaggc cgactcgcgg 300 
agcccagcgc tactagctgc acccgattcc 360 
tgggacgatt cagtatttcc ccggatgaag 420 
tcaactactc ctaccccacc aagcaagctg 480 
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ctctgaaaag ccattatgca gatgtagatc ctgaaaacca gaacttttta cttgaatcga 540 
atttggggaa gaagaagtat gaaacagaat ttcatccagg tac tact tec tttggaatgt 600 
cagtatttaa tctgagcaat gcgattgtgg gcagtggaat ccttgggctt tettatgeca 660 
tggctaatac tggaattget ctttttataa ttctcttgac atttgtgtca atattttccc 720 
tgtattctgt tcatctcctt ttgaagactg ccaatgaagg agggtcttta ttatatgaac 780 
aattgggata taaggcattt ggattagttg gaaagcttgc agcatctgga tcaattacaa 840 
tgcagaacat tggagctatg tcaagctacc tcttcatagt gaaatatgag ttgcctttgg 900 
tgatccaggc attaacgaac attgaagata aaactggatt gtggtatctg aacgggaact 960 
atttggttct gttggtgtca ttggtggtca ttcttccttt gtcgctgttt agaaatttag 1020 
gatatttggg atataccagt ggcctttcct tgttgtgtat ggtgttcttt ctgattgtgg 1080 
teatttgeaa gaaatttcag .gttccgtgtc ctgtggaagc tgctttgata attaacgaaa 1140 
caataaacac caccttaaca cagccaacag ctcttgtacc tgctttgtca cataacgtga 1200 
ctgaaaatga etcttgeaga cctcactatt ttattttcaa ctcacagact gtctatgctg 1260 
tgecaattet gatcttttca tttgtctgtc atectgetgt tcttcccatc tatgaagaac 1320 
tgaaagaccg cagcegtaga agaatgatga atgtgtccaa gatttcattt tttgctatgt 1380 
ttctcatgta tctgcttgcc gccctctttg gatacctaac attttacgaa catgttgagt 1440 
cagaattget tcatacctac tcttctatct tgggaactga tattcttctt ctcattgtcc 1500 
gtctggctgt gttaatggct gtgaccctga cagtaccagt agttattttc ccaatccgga 1560 
gttctgtaac tcacttgttg tgtgcatcaa aagatttcag ttggtggcgt catagtctca 1620 
ttacagtgtc tatcttggca tttaccaatt tacttgtcat ctttgtccca actattaggg 1680 
atatctttgg ttttattggt geatctgeag cttctatgtt gatttttatt cttccttctg 1740 
ccttctatat caagttggtg aagaaagaac ctatgaaatc tgtacaaaag attggggctt 1800 
tgttcttcct gttaagtggt gtactggtga tgaccggaag catggccttg attgttttgg 1860 
attgggtaca caatgcacct ggaggtggcc attaattggc accactcaaa ctcaaactca 1920 
gtccatctga tgccagtgtt gagtaaactc aactactatg aaatttcacc taatgttttc 1980 
agtttcactt ccttttgaag tgeagattec tcgctggttc ttctgagtgc agaataagtg 2040 
aacttttttg ttttgttttg tttttttaag aaacttatct gtatgttaga aatggatatg 2100 
aacaacaaaa ccacgagtct egggttaagg gaagtgacaa ttttattcca ttccagagaa 2160 
tggacaaact cttaactttt atcaagccac atgcttggct gtgtcattgt ttaacttgga 2220 
tattttatga ttttacttga atgtgcctaa tggaaccatt tgatgtgaga aacaattctt 2280 
tttaatttac agcaaaatat tgaataacca ttgacaaaaa 2320 

<210> 49 
<211> 1781 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 6025572CB1 

<400> 49 

gtcactttct cgccagtacg atgetgeage ggttttccgg ttttccgett cccttcatcg 60 
tagctcccgt actcattttt agccactgct gccggttttt atatccttct ccatcatgca 120 
tegtgagect gcgaaaaaga aggcagaaaa gcggctgttt gacgcctcat ecttegggaa 180 
ggaccttctg geeggeggag tcgcggcagc tgtgtccaag acagcggtgg cgcccatcga 240 
gcgggtgaag ctgctgctgc aggtgeagge gtcgtcgaag cagatcagcc ccgaggcgcg 300 
gtacaaaggc atggtggact gcctggtgcg gattcctcgc gagcagggtt tcttcagttt 360 
ttggcgtggc aatttggcaa atgttattcg gtattttcca acacaagctc taaactttgc 420 
ttttaaggac aaatacaagc agctattcat gtctggagtt aataaagaaa aacagttctg 480 
gaggtggttt ttggcaaacc tggcttctgg tggagctget ggggcaacat ccttatgtgt 540 
agtatatcct ctagattttg cccgaacccg attaggtgtc gatattggaa aaggtcctga 600 
ggagcgacaa ttcaagggtt taggtgactg tattatgaaa atagcaaaat cagatggaat 660 
tgctggttta taccaagggt ttggtgtttc agtacagggc atcattgtgt accgagcctc 720 
ttattttgga gcttatgaca cagttaaggg tttattacca aagecaaaga aaactccatt 780 
tcttgtctcc tttttcattg ctcaagttgt gaetacatge tctggaatac tttcttatcc 840 
ctttgacaca gttagaagac gtatgatgat gcagagtggt gaggctaaac ggcaatataa 900 
aggaacctta gaetgetttg tgaagatata ccaacatgaa ggaatcagtt ccttttttcg 960 
tggcgccttc tccaatgttc ttcgeggtae agggggtget ttggtgttgg tattatatga 1020 
taaaattaaa gaattctttc atattgatat tggtggtagg taategggag agtaaattaa 1080 
gaaatacatg gatttaactt gttaaacata caaattacat agetgecatt tgcatacatt 1140 
ttgatagtgt tattgtctgt attttgttaa agtgctagtt ctgcaataaa gcatacattt 1200 
tttcaagaat ttaaatacta aaaatcagat aaatgtggat tttcctccca cttagactca 1260 
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aacacatttt agtgtgatat ttcatttatt 
aaattctttt tatgattaaa aattaatcat 
aatgaaagta gcgtctttta aattgctatt 
aagtcatatg gtatgacata tttcttaaaa 
gcagaaataa gctttgttct atatctcttc 
ttacagtatc agcctttgat tatagatgtg 
acattataaa actgaaactg gaaaataaaa 
taaataaaat cttgctagtg tgaatatatc 
gtttgtatat tttgttgaca ataaaggaag 

<210> 50 

<211> 2433 

<212> DNA 

<213> Homo sapiens 



ataggtagta tattttaatt tgttagttta 1320 
ataatcctag attaatgctg aaatctagga 1380 
catttaatat acctgttttc ccatcttttg 1440 
gcttatcaat agatgtcatc atatgtgtag 1500 
taagacagtt gttattactg tgtataatat 1560 
atcatttaaa atttgataat gactttagtg 1620 
tggcttatct gctgatgttt atctttaaaa 1680 
ttagaacaaa aggtatcctc ttgaaaatta 1740 
cttaactgtt a 1781 



<220> 

<221> misc_feature 

<223> Incyte ID No: 5686561CB1 



<400> 50 

catccgctca caatgccaca tcaatgatac 
atggcagagt cggctgcgag caggcgaggt 
ggctttttgc tgtggttctg agccggcctg 
ggtctccctg ggtgcccact cttgcgcccg 
gaagctcagc ctgctgggag gccacaggga 
cagccggttg ggtccggggc ctggagctca 
ggagctgccg cttctgctct cagcaggatg 
tccagaacct gcctgagtct ctgacttccc 
ccgatgtgat gattcctgcg tattccaaga 
tcactgtgat aggaagcctg tttctgatga 
tccggggcta cctgatgaaa tctctccaga 
gggctgcctt tgaagtccta tcctccatgg 
ttggggtgaa gccccagaac ttgctgcagg 
acaaacaggc catgatggag aaggtgcgtt 
agtttcagaa gctcttcaac gagcttgaca 
ccgagtacca gtctccgttt ctgcagagcg 
actacctggg gaacctcatc gccctggcaa 
tggatgcaga tgtgctgcct gctgagcgtg 
tcttcattgt gtactacctg ttggagatgc 
ggtacctgtc ctaccccagc aacgtgtttg 
tggagatctc aactctggct. gtgtaccgat 
tgggcctgct gtcgctgtgg gacatgaccc 
tcctgcgtat catccccagc atgaagccga 
tggtgcagaa catgcgtgct tttggcggga 
tcattgggat caacttgttt agaggcgtca 
cccctgccaa tggctcggcg ccctgtggga 
acttcgatga ctttgcggct gccctggtca 
ggcaggtgtt tctggatgca tatcggcgct 
tgttgtggtg gctggtgtcg tctgtcatct 
agaacttcct tcacaagtgg gacccccgca 
aggccaccta ccagatgact gtggagctcc 
aggatgagct cacagagagg ctgagccagc 
cgggctgccg tcccagcagg ggcggcagga 
atggaagagg cggccatgct gtggccagcc 
cactaagctg gggacaggaa ccaagtcctt 
agctgctggt gcttcaggga ggcgccgtgc 
agaattccct cgtcgactcc acagggacct 
cccctgtccc ctgcagcttc cgtggtgcct 
cctgaccagg gcctgcacag gttaaccgtc 
ggtggtttga catggagaga accttgactg 
tgactgggtg tgtttcttta gggttctgat 



gagcacgtag cctcactgct tgcacagtgc 60 
ggcctgaggg aggtcactag gctggctgag 120 
cttccaggca ccgtgtccat gcgggtaagc 180 
gagatcctga gtttggtcct gtctggccat 240 
gatgcaggct gggcggcggg tggatggttc 300 
gcctgtgggg tggggaccca gtggtgccct 360 
atgggcagga cagggagagg ctgacctact 420 
tcctggtgct gctgaccacg gccaacaacc 480 
accgggccta tgccatcttc ttcatagtct 540 
acctgctgac agccatcatc tacagtcagt 600 
cctcgctgtt tcggaggcgg ctgggaaccc 660 
tgggggaggg aggagccttc cctcaggcag 720 
tgcttcagaa ggtccagctg gacagctccc 780 
cctacgacag tgttctgctg tcagctgagg 840 
gaagtgtggt taaagagcac ccgccgaggc 900 
cccagttcct cttcggccac tactactttg 960 
acctggtgtc catttgcgtg ttcctggtgc 1020 
atgacttcat cctggggatt ctcaactgcg 1080 
tgctcaaggt ctttgccctg ggcctgcgag 1140 
acgggctcct caccgttgtc ctgctggttt 1200 
tgccacaccc aggctggagg ccggagatgg 1260 
gcatgctgaa catgctcatc gtgttccgct 1320 
tggccgtggt ggccagtacc gtcctgggcc 1380 
tcctggtggt ggtctactac gtatttgcca 1440 
ttgtggctct tcctggaaac agcagcctgg 1500 
gcttcgagca gctggagtac tgggccaaca 1560 
ctctgtggaa cttgatggtg gtgaacaact 1620 
actcaggccc gtggtccaag atctattttg 1680 
gggtcaacct gtttctggcc ctgattctgg 1740 
gccacctgca gccccttgct gggaccccag 1800 
tgttcaggga tattctggag gagcccgggg 1860 
acccgcacct gtggctgtgc aggtgacgtc 1920 
gagagaggct ggcctacaca ggtgcccatc 1980 
aggcaggaag agacctttcc tctgacggac 2040 
tgcgtgtggc ccaacaacca tctacagaac 2100 
cctccgcttt cttttatagc tgcttcagtg 2160 
ttcagacaaa aatgcaagaa gcagcggcct 2220 
ttgctgccgg cagcccttgg ggaccacagg 2280 
agacttccgg ggcattcagg tggggatgct 2340 
tgttttatta tttcatggct tgtatgagtg 2400 
tgc 2433 



<210> 51 
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<211> 1772 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 1553725CB1 



<400> 51 

cccacgcgtc cgaactggtg gcatttgtcc 
caagatggag gctgacctgt ctggctttaa 
cttcctgggg agagtgaagc acttcctaaa 
tgagcgggag ctggactggg ccaaggtgat 
cccaggcacc caagtggagc agctgctgta 
ccccgacact ggggagaaga tgaatgtcat 
catgatcatc acgggcttca tgctccagtt 
gcagtgggtg aaccagtcct tcaatgcctt 
ccccacatca gtcaggcaga tggccctttc 
cacggctgtg ggcatgaaca tgttgacaaa 
gccctttgcc gctgtggctg cggctaactg 
gctcataaag ggaatctgcg tgaaggacag 
agctgcggcc ataggcatca cccaagtagt 
gatgatcttg ctgccagtca tcatggaaag 
caaggtcctg cacgccccat tgcaggtcat 
gccagtggcg tgtgggcttt tcccacagaa 
gaagctccaa gacactatca aggccaagta 
taagggtctc taaatgcccc acttcagcaa 
ctccttagct acgtgcacac ttgtgtcctc 
agggtagatt ggggggtggg acaatgaatg 
tggacctcag gggaaaaaag tgaaaaaggg 
ctcaacccct gtcccctgga gaccagaagc 
caaatcattt ggaaaagtta ggaaaccttt 
aagaccttgg atcaaaagga agcttctata 
agcaatggaa acttttaccc atgtaattct 
gactctgtcc ccttggggct catcaccctt 
aggcagcttc tttttttttt ttcaagatgg 
agtggcgcga tctcggctca ctgcaaactc 
tcagcctctc aagtagctgg gattacaggg 
attttagtgg agacggggtt tcaccatgct 



cgggaccagg tccacagttt tatgtgtgag 60 
catcgatgcc ccccgttggg accagcgcac 120 
catcacggac ccccgcactg tctttgtatc 180 
ggtggagaag agcaggatgg gggttgtgcc 240 
tgccaagaag ctgtatgact cggccttcca 300 
cgggcgcatg tctttccagc ttcctggcgg 360 
ctacaggacg atgccggcgg tgatcttctg 420 
agtcaactac accaacagga atgcggcttc 480 
ctacttcaca gccacaacca ctgctgtggc 540 
gaaagcgccg cccttggtgg gccgctgggt 600 
tgtcaatatc cccatgatgc gacagcagga 660 
gaatgaaaat gagattggtc attcccggag 720 
tatttctcgg atcaccatgt cagctcctgg 780 
gcttgagaaa ttgcacttca tgcagaaagt 840 
gctgagcggg tgcttcctca tcttcatggt 900 
atgtgaattg ccagtttcct atctggaacc 960 
tggagaactt gagccttatg tctacttcaa 1020 
ggaccagtct attcccatat tcaccagctc 1080 
cttccccttt gccaacaagg cctgaaggcc 1140 
cctcatactt acaccctggt actggttgat 1200 
tagcaaaggc caatgtcttc tagctgcttc 1260 
tgaggccctc tcagggagga gacatccaag 1320 
aggattctgg ttccagccag ggttgaggaa 1380 
cctctttctt cttcgcttcc tcctctccca 1440 
agctgaactc aggaaaaaga agggggaaag 1500 
ccacatcctc ctcctcgttg ccccctggtc 1560 
agtcttgctc tgtcgcccag gctggaatgc 1620 
tgcctcctgg attcaagcga ttctcctgcc 1680 
cacctgccac cacgcctggc taatttttgt 1740 
gg 1772 



<210> 52 

<211> 1874 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1695770CB1 

<400> 52 

atcttttcca gctcccagct ggcaggctaa 
agaagccagt agggttatct gtgtcaggat 
caggtaaaga taggccagag gagaagaggc 
agcagcccag gttggagaaa ggctctgtac 
cacattggca gccaggctga gaaagggctt 
cctgcaaccc tctccttcct tcttctctgg 
gccttggcaa aagaggaagt caaatctgga 
gatttcctag acaaacttat ggggcgaaca 
tttaaaggcc cacccgtgaa cgtgacctgc 
accaagacca caatggacta ccgggtgaat 
cgcctgtcct accgagaata tcctgatgac 
tctatctgga agccagacct cttctttgct 
accacggaca acaagttact gcgcatcttc 



ggccctccgg agcccaaggc gagcccaagc 60 
catttccagg ggaatagttc tggcccctgg 120 
agaagaggag agaaagcagg ctcttttgcg 180 
ttttggcgtt cctgcaggga tatcccctct 240 
caagatcccc gcagaatgac aactcttgtt 300 
accctgccag ggcaggtcct cctcagggtg 360 
accaaggggt cccagcccat gtccccctct 420 
tctggatatg atgccaggat tcggcccaat 480 
aacatcttca tcaacagttt cagctccgtc 540 
gtcttcttgc ggcaacagtg gaatgaccca 600 
tctctggacc tcgatccctc catgctggac 660 
aatgagaaag gggccaactt ccatgaggtg 720 
aagaatggga atgtgctgta cagcatcagg 780 
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ctgaccctca ttttgtcctg 
tgcacgatgc agcttgagag 
gaagatgctc ctgctgtcca 
gatgagaagg atctaggctg 
gaggtaaagt ttcacctgga 
agcctactca tcgtcatcct 
gcccgtgtgg gcctgggcat 
cgggcctctt tgcctaaggt 
ctgctctttg tgttcgctgc 
cataaagaat tcatacgact 
atccaagaaa gtcgtttcta 
gatggaggtc caatggaagg 
agggaaggag aaaccacgcg 
tcccgggctg tcttcccttt 
aaagtgctac ggtcagaaga 
cctgctgctg gcctcctgct 
ggggtttgga cagttccttc 
tatgtgggcc tatattgcat 
ccttaaaaaa aaaa 

<210> 53 
<211> 6211 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 4672222CB1 

• <400> 53 

cggcggaggc gggcgcgggc gcgtccctgt ggccagtcac ccggaggagt tggtcgcaca 60 

attatgaaag actcggcttc tgctgctagc gccggagctg agttagttct gagaaggttt 120 

ccctgggcgt tccttgtccg gcggcctctg ctgccgcctc cggagacgct tcccgataga 180 

tggctacagg ccgcggagga ggaggaggtg gagttgctgc ccttccggag tccgccccgt 240 

gaggagaatg tcccagaaat cctggataga aagcactttg accaagaggg aatgtgtata 300 

tattatacca agttccaagg accctcacag atgccttcca ggatgtcaaa tttgtcagca 360 

actcgtcagg tgtttttgtg gtcgcttggt caagcaacat gcttgtttta ctgcaagtct 420 

tgccatgaaa tactcagatg tgaaattggg tgaccatttt aatcaggcaa tagaagaatg 480 

gtctgtggaa aagcatacag aacagagccc aacggatgct tatggagtca taaattttca 540 

agggggttct cattcctaca gagctaagta tgtgaggcta tcatatgaca ccaaacctga 600 

agtcattctg caacttctgc ttaaagaatg gcaaatggag ttacccaaac ttgttatctc 660 

tgtacatggg ggcatgcaga aatttgagct tcacccacga atcaagcagt tgcttggaaa 720 

aggtcttatt aaagctgcag ttacaactgg agcctggatt ttaactggag gagtaaacac 780 

aggtgtggca aaacatgttg gagatgccct caaagaacat gcttccagat catctcgaaa 840 

gatttgcact atcggaatag ctccatgggg agtgattgaa aacagaaatg atcttgttgg 900 

gagagatgtg gttgctcctt atcaaacctt attgaacccc ctgagcaaat tgaatgtttt 960 

gaataatctg cattcccatt tcatattggt ggatgatggc actgttggaa agtatggggc 1020 

ggaagtcaga ctgagaagag aacttgaaaa aactattaat cagcaaagaa ttcatgctag 1080 

gattggccag ggtgtccctg tggtggcact tatatttgag ggtgggccaa atgttatcct 1140 

cacagttctt gaataccttc aggaaagccc ccctgttcca gtagttgtgt gtgaaggaac 1200 

aggcagagct gcagatctgc tagcgtatat tcataaacaa acagaagaag gagggaatct 1260 

tcctgatgca gcagagcccg atattatttc cactatcaaa aaaacattta actttggcca 1320 

gaatgaagca cttcatttat ttcaaacact gatggagtgc atgaaaagaa aggagcttat 1380 

cactgttttc catattgggt cagatgaaca tcaagatata gatgtagcaa tacttactgc 1440 

actgctaaaa ggtactaatg catctgcatt tgaccagctt atccttacat tggcatggga 1500 

tagagttgac attgccaaaa atcatgtatt tgtttatgga cagcagtggc tggttggatc 1560 

cttggaacaa gctatgcttg atgctcttgt aatggataga gttgcatttg taaaacttct 1620 

tattgaaaat ggagtaagca tgcataaatt ccttaccatt ccgagactgg aagaacttta 1680 

caacactaaa caaggtccaa ctaatccaat gctgtttcat cttgttcgag acgtcaaaca 1740 

gggaaatctt cctccaggat ataagatcac tctgattgat ataggacttg ttattgaata 1800 

tctcatggga ggaacctaca gatgcaccta tactaggaaa cgttttcgat taatatataa 1860 

tagtcttggt ggaaataatc ggaggtctgg ccgaaatacc tccagcagca ctcctcagtt 1920 

gcgaaagagt catgaatctt ttggcaatag ggcagataaa aaggaaaaaa tgaggcataa 1980 



cctgatggac ctcaagaact 
ctttggctac accatgaaag 
agtggctgag gggctgactc 
ttgtaccaag cactacaaca 
acggcagatg ggctactatc 
gtcctgggtc tccttctgga 
caccaccgtg ctcaccatga 
gtcctacgtg aaggcaatcg 
cttgctggag tatgctgcca 
tcgaagaagg cagaggcgcc 
tttccgtggc tatggcttgg 
ttctggcatt tatagtcccc 
gaaactctac gtggactgag 
cactttcctc atcttcaata 
tatccaccag gctctgtgaa 
tcctcctggg tgggctttct 
ctgatctccc actcagaact 
ggtgccaatg gtggctgtac 



tccccatgga catccagacg 840 
acctcgtgtt tgagtggctg 900 
tgccccagtt tatcttgcgg 960 
cagggaaatt cacctgcatc 1020 
tgattcagat gtacatcccc 1080 
tcaacatgga tgctgctcct 1140 
ccacccagag ctctggctcc 1200 
acatctggat ggctgtgtgt 1260 
taaattttgt ttctcgtcag 1320 
aacgcttgga ggaagatatc 1380 
gccactgcct gcaggcaaga 1440 
aacctccagc ccctcttcta 1500 
ccaagagaat tgacaccatc 1560 
tcttctactg ggttgtctat 1620 
tagggtggga gctatagagt 1680 
ccctcagtta gactccatta 1740 
tcaactacca gtcccaaagc 1800 
ttataaagat ggttatctac 1860 

1874 
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ccatttcatt aagacagcac agccctaccg 
aaagaagaaa agaaccaaag atgaaattgt 
tccttatcca cttaatgaac ttttaatttg 
ccgtttttta tggcaacatg gtgaagaatc 
ctatcgttca atggcatatg aagcaaagca 
actaaaacag tattccaatg attttggtca 
cagacaagat gaaaccatgg ctatgaaatt 
ttcaacctgc cttaagttag cagtttcttc 
tacacaaatg ttgttatctg atatgtggat 
gtacaaggtc atactaagca ttttagttcc 
taaggctgaa atgtcccata tcccacaatc 
cagcgaaaac aactttcaga acataacaga 
acggattttg gatagtaatg . aaggaaagaa 
gcttccaatt acgcgaaagt tttatgcctt 
taacacgttg gcatatttag gatttctgat 
ggaacagtta ccttcagttc aagaatggat 
tgagaaagtc cgtgagatct ttatgtctga 
atggtttagt gattacttca acatcagtga 
atttggacta agatttggag caaaatggaa 
tgtggctgga agattaattt actgtcttaa 
ttttctagct gtaaatcaac aggcaggacc 
caatatgttc tacattgtag tgattatggc 
aaaggcaata ctttatcctc atgaagcacc 
tcacccatac tggatgattt ttggtgaagt 
tgattctgtt atccctcaaa tctgtggtcc 
agtctacctc tttgtacagt atatcattat 
tgtgtattta caagtgaagg caatttccaa 
tattatggct tatcatgaga aaccagttct 
agtttctctg ttttgctgca tatgtaagag 
aaaacttttc ttaacagaag aagatcaaaa 
tgaaatgtat ttcaatgaaa aagatgacaa 
tgtcactttt gaaagagtgg aacagatgtg 
caactacata aaaagatcat tacaatcatt 
ttcagccctg acggtagata cattaaaaac 
caaagttcat aatgaaatca cacgagaact 
tattgatgat ggtcctgtaa gaccttctgt 
tagctcctct cttcctcaag gtgatcttga 
aatgaaagat gacaaagatc cccagtgtaa 
ccagagaaaa gaatttaatt ttccagaggc 
tgctgtttcc cctccagaac tgcgacagag 
taataaaaat caaaaattag gcagttcatc 
aaccaaattt tttgttagta caccatctca 
aaccaaagat caagaaactg tttgctctaa 
agcatttgta ggacacagag atagcatgga 
gataaaaata ctatccaata acaatacttc 
tgctggattt actgactgtc acagaacttc 
cagtagaagg ccatctaccg aagacactca 
ggattggtta caagatagac catcaaacag 
tggtctcact tctccattta agccagctat 
aagaaataac ttgatgaggt tatcacagag 
ggagcctgtc acagtgtatc gtttggaaga 
gtcttcttgg tcacaactag gcctctgtgc 
gggaggaggt ttacgaagag ctgtcaaagt 
caaatcaggg catctttata ttatcaaatc 
aagtatttat aaagaagata cagttctgca 
agcagcacaa aagcttacgt ttgcctttaa 
tccaaggttc cttgaagttt tcctgctgta 
ggaagaatgt atgactggag aatttagaaa 
tccaactaat actctggaag agatcatgct 
aagaggggag ttactggtac ttgatttgca 
tgtgataaaa gcagaagaaa agagatcctg 
agaagatgca attaaaaact tcagagcaaa 
taaacttcca gatctgaaga ggaatgatta 



accaaagatt gatacagtta tggaagaagg 2040 
agacattgat gatccagaaa ccaagcgctt 2100 
ggcttgcctt atgaagaggc aggtcatggc 2160 
aatggctaaa gcattagttg cctgtaagat 2220 
gagtgacctg gtagatgata cttcagaaga 2280 
gttggccgtt gaattattag aacagtcctt 2340 
gctcacttat gaactgaaga actggagtaa 2400 
aagacttaga ccttttgtag ctcacacctg 2460 
gggaaggctg aatatgagga aaaattcctg 2520 
acctgccata ttgctgttag agtataaaac 2580 
tcaagatgct catcagatga caatggatga 2640 
agagatcccc atggaagtgt ttaaagaagt 2700 
tgagatggag atacaaatga aatcaaaaaa 2760 
ttatcatgca ccaattgtaa aattctggtt 2820 
gctttataca tttgtggttc ttgtacaaat 2880 
tgttattgct tatattttta cttatgccat 2940 
agctgggaaa gtaaaccaga agattaaagt 3000 
tacaattgcc ataatttctt tcttcattgg 3060 
ctttgcaaat gcatatgata atcatgtttt 3120 
cataatattt tggtatgtgc gtttgctaga 3180 
ttatgtaatg atgattggaa aaatggtggc 3240 
tcttgtatta cttagttttg gtgttcccag 3300 
atcttggact cttgctaaag atatagtttt 3360 
ttatgcatac gaaattgatg tgtgtgcaaa 3420 
tgggacgtgg ttgactccat ttcttcaagc 3480 
ggttaatctt cttattgcat ttttcaacaa 3540 
tattgtatgg aagtaccagc gttatcattt 3600 
gcctcctcca cttatcattc ttagccatat 3660 
aagaaagaaa gataagactt ccgatggacc 3720 
gaaacttcat gattttgaag agcagtgtgt 3780 
atttcattct gggagtgaag agagaattcg 3840 
cattcagatt aaagaagttg gagatcgtgt 3900 
agattctcaa attggccatt tgcaagatct 3960 
actcactgcc cagaaagcgt cggaagctag 4020 
gagcatttcc aaacacttgg ctcaaaacct 4080 
atggaaaaag catggtgttg taaatacact 4140 
aagtaataat ccttttcatt gtaatatttt 4200 
tatatttggt caagacttac ctgcagtacc 4260 
tggttcctct tctggtgcct tattcccaag 4320 
actacatggg gtagaactct taaaaatatt 4380 
tactagcata ccacatctgt catccccacc 4440 
gccaagttgc aaaagccact tggaaactgg 4500 
agctacagaa ggagataata cagaatttgg 4560 
tttacagagg tttaaagaaa catcaaacaa 4620 
tgaaaacact ttgaaacgag tgagttctct 4680 
cattcctgtt cattcaaaac aagaaaaaat 4740 
tgaagtagat tccaaagcag ctttaatacc 4800 
agaaatgcca tctgaagaag gaacattaaa 4860 
ggatacaaat tactattatt cagctgtgga 4920 
cattccattt acacctgtgc ctccaagagg 4980 
gagttcaccc aacatactaa ataacagcat 5040 
caaaatagag tttttaagca aagaggagat 5100 
acagtgtacc tggtcagaac atgatatcct 5160 
ttttcttcca gaggtggtta atacatggtc 5220 
tctctgtctg agagaaattc aacaacagag 5280 
tcaaatgaaa cccaaatcca taccatattc 5340 
ttgccattca gcaggacagt ggtttgctgt 5400 
atacaacaat aataatggag atgagattat 5460 
agcctttagc cactggactt acgaatatac 5520 
aggtgttggt gaaaatttga ctgacccatc 5580 
tgatatggtt tttggcccag caaatctagg 5640 
acatcactgt aattcttgct gtagaaagct 5700 
tacgcctgat aaaattatat ttcctcagga 5760 
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tgagccttca gatttgaatc 
ttctgttcgt ctgatgttat 
acagaaatgt tactgtgtca 
tgtatgcaag ttgaatttgc 
gattaatcag tcagaaagtc 
attgataatt agtatttata 
gactttgttt' tgatgagggt 
taggtgtcca tgcagatttt 

<210> 54 
<211> 3714 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 6176128CB1 

<400> 54 

atggcgcggg ccaagctgcc gcgctcgccg tccgagggca aggcgggccc ggggggcgcc 60 
ccagccggcg ccgcagcccc cgaggagcct cacgggctca gcccgctgct gccggcccgc 120 
ggcgggggct ccgtgggcag cgacgtgggc cagaggcttc ctgtagaaga tttcagcctg 180 
gactcctccc tgtctcaggt ccaggtggag ttctacgtca acgagaacac cttcaaggag 240 
cggctcaagc tgttcttcat caaaaaccaa agatcgagtc tgaggatccg gctgttcaac 300 
ttctccctga agctgctcac ctgcctgctc tacattgtgc gcgtcctgct cgatgacccg 360 
gccctgggca tcggatggtg gggctgccca aggcagaact ac.tccttcaa tgactcgtcc 420 
tccgagatca actgggctcc tattctgtgg gtggagagaa agatgacact gtgggcgatc 480 
caggtcatcg tggccataat aagcttcctg gagacgatgc ttctcatcta cctcagctac 540 
aaaggcaaca tctgggagca gatcttccgc gtgtccttcg tcctggagat gatcaacact 600 
ctgcccttca tcatcacgat cttctggccg ccgctgcgga acctgttcat ccccgtcttt 660 
ctgaactgct ggctggccaa gcacgcgctg gaaaacatga ttaatgactt ccaccgtgcc 720 
atcctgcgga cacagtcagc catgttcaac caggtcctca tcctcttctg caccctgctg 780 
tgcctcgttt tcacggggac ctgcggcatc cagcacctgg agcgggcggg cgagaacctg 840 
tccctcctga cctccttcta cttctgcatc gtcaccttct ccaccgtggg ctacggtgac 900 
gtcacgccca agatctggcc atcgcagctg ctggtggtca tcatgatctg cgtggccctc 960 
gtggtgctcc cactgcagtt cgaggagctc gtctacctct ggatggagcg gcagaagtca 1020 
gggggcaact acagccgcca ccgtgcgcag acggagaagc acgtggtcct gtgtgtcagc 1080 
tccctcaaga tcgaccttct catggacttc ctgaacgagt tctacgccca cccccggctc 1140 
caggactatt acgtggtcat cctgtgcccc acggagatgg atgtccaggt gcgcagagtc 1200 
ctgcagatcc ctctgtggtc ccagcgggtc atctacctcc agggctctgc actcaaagac 1260 
caggacctca tgcgagccaa gatggacaat ggggaggcct gcttcatcct cagcagcagg 1320 
aacgaggtgg accgcacggc tgcagaccac cagaccatcc tgcgcgcctg ggccgtgaag 1380 
gacttcgccc ccaactgccc cctctacgtc cagatcctca aacctgaaaa caagtttcac 1440 
gtcaagtttg ctgaccacgt ggtgtgtgag gaggagtgca agtacgccat gctggcgctg 1500 
aactgcatct gcccggcgac ctccaccctc atcaccctgc tggtgcacac gtcccgcggc 1560 
caggagggac aggagtctcc ggagcagtgg cagcgcatgt atgggcgctg ctccggcaac 1620 
gaggtgtacc acatccgcat gggtgacagc aagttcttcc gcgagtacga gggcaagagc 1680 
ttcacctacg cggccttcca cgcccacaag aagtatggcg tgtgcctcat cgggctgaag 1740 
cgggaggaca acaagagcat cctgctgaac ccggggcccc ggcacatcct ggccgcctct 1800 
gacacctgct tctacatcaa catcaccaag gaggagaact cggccttcat cttcaagcag 1860 
gaggagaagc ggaagaagag ggccttctcg gggcaggggc tgcacgaggg tccggcccgc 1920 
ctgcccgtgc acagcatcat cgcctccatg gggacagtgg ccatggacct gcagggcaca 1980 
gagcaccggc ctacgcagag cggcggtggg ggcgggggca gcaagctggc actgcccacg 2040 
gagaacggct cgggcagccg gcggcccagc atcgcgcccg tcctggaact ggccgacagc 2100 
tcagccctgc tgccctgcga cctgctgagc gaccagtcgg aggatgaggt gacgccgtcg 2160 
gacgacgagg ggctctccgt ggtagagtat gtgaagggct accctcccaa ctcgccctac 2220 
atcggcagct ccccaaccct gtgccacctc ctgcctgtga aagccccctt ctgctgcctg 2280 
cggctggaca agggctgcaa gcacaacagc tatgaagacg ccaaggccta cgggttcaag 2340 
aacaagctga tcatcgtctc ggcagagacg gccggcaatg ggctgtacaa cttcatcgtg 2400 
ccactgcggg cctactacag atcccgcaag gagctgaacc ccatcgtgct gctgctggac 2460 
aacaagcccg accaccactt cctggaagcc atctgctgct tccccatggt ctactacatg 2520 
gagggctctg tggacaacct ggacagcctg ctgcagtgtg gcatcatcta tgcggacaac 2580 
ctggtggtgg tggacaagga gagcaccatg agcgccgagg aggactacat ggcggacgcc 2640 



ttcagcctgg aaattccacc aaagaatcag aatcaactaa 5820 

aatattaata ttactgaatc attggttttg cctgcacctc 5880 

cttttccctc gggaggaaat tgtttggtaa tatagaaagg 5940 

tgactccagc acagttaaaa ggtcaatatt cttttgacct 6000 

cctataggat agagctggca gctgagaaat tttaaaggta 6060 

actttttaaa gggctctttg tatagcagag gatctcattt 6120 

gatgctctct cttatgtggt acaataccat taaccaaagg 6180 

attggcagct g 6211 
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aagaccatcg tcaacgtgca gaccatgttc 
gagctcaccc acccttccaa catgcgcttc 
ctggctcttt ccaaactaga aaagagggag 
ttccgcctgc cgttcgccgc cggccgcgtc 
taccagtcct tcgtgaagga ctacatgatc 
accacgccgg gctcggggta cctctgtgcc 
cgcacgtacg gccgcctctt ccagaagctc 
atctaccgga cagagagcca cgtcttctcc 
tcccagatct cggtgaacgt ggaggactgt 
ggctcccgcg ctggcaccgg aggcagctcc 
gcagagcacc cactgctacg gcgcaagagc 
gcgcccaagc aggcaggccg ggcggcggcc 
ctgtaccggc gctctgagcg ccaggagctc 
ctggggctgc ccaccaccgg ctacgaggac 
aatcgggtaa acctgggata tttgcaagac 
tacgtcctca tcaaccctcc gcccgacacg 
atccgctccg accccctggc tcacgtggcc 
agccacaagc tgtcgtcctg caaccccgag 

<210> 55 

<211> 3115 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc„feature 

<223> Incyte ID No: 7473418CB1 

<400> 55 

atggcctcgg cgctgagcta tgtctccaag 
ccgctcctgc tgctgccact cgtcattctg 
gtcatcatcc tcatggccat ttactggtgc 
ctcatgcctg tcttgctttt cccactcttc 
cagtacatga aggacaccaa catgctgttc 
gagcgctgga acctgcacaa gaggatcgcc 
cctgcacggc tgatgctggg cttcatgggc 
aacacggcaa ccacggccat gatggtgccc 
gccacaagcg cagccaccga ggccggcctg 
ctgccagcta acagcgctgt gcccaccaca 
ctggggcagc aggaagacca agagcggaag 
tgctacgcgg ccagcatcgg gggcaccgcc 
ctcctgggcc agatgaacga gttgtttcct 
tggtttgcat ttgcctttcc caacatgctg 
cagtttgttt acatgagatt caattttaaa 
aaaaacgaga aggctgccct caaggtgctg 
tccttcgcgg agatcaacgt gctgatctgc 
cgagaccccg gcttcatgcc cggctggctg 
ccattttatc cccctcccct gctggattgg 
atcgtgctgc tactaggggg cggatttgct 
tccgtgtgga tggggaagca gatggagccc 
ttgatcttgt ccttgctcgt tgccgtgttc 
accttgttcc tgcccatctt tgcctccatg 
atcatgctgc cctgtaccct gagtgcctcc 
ccaaatgcca tcgtgttcac ctatgggcac 
gtcataatga acataattgg agtcttctgt 
gccatatttg acttggatca tttccctgac 
gaagagccac aagaccacac acacagccct 
cacaccttgt acagagtttt ggggttcaca 
caccaaaapc cagccaatgg gccacctctt 
ggcagctgga gggtaggctc agaaatgaag 
tttcccaagc cttgccatta tctctgtgag 
aggctgctgt acccgctctg cctcaagcat 
ttcgtcctag atagtttaaa tgggaatcgg 



cggctcttcc ccagcctcag catcaccacg 2700 
atgcagttcc gcgccaagga cagctactct 2760 
cgagagaatg gctccaacct ggccttcatg 2820 
ttcagcatca gcatgttgga cacactgctc 2880 
accatcaccc ggctgctgct gggcctggac 2940 
atgaaaatca ccgagggcga cctgtggatc 3000 
tgctcctcca gcgccgagat ccccattggc 3060 
acctcggagc cccacgaact cagagcccag 3120 
gaggacacac gggaagtgaa ggggccctgg 3180 
cagggccgcc acacgggcgg cggtgacccc 3240 
ctgcagtggg cccggaggct gagccgcaag 3300 
gcggagtgga tcagccagca gcgcctcagc 3360 
tccgagctgg tgaagaaccg catgaagcac 3420 
gtagcaaatt taacagccag tgatgtcatg 3480 
gagatgaacg accaccagaa caccctctcc 3540 
aggctggagc ccagtgacat tgtgtatctc 3600 
agcagctccc agagccggaa gagcagctgc 3660 
actcgcgacg agacacagct ctga 3714 



ttcaagtcct tcgtgatctt gttcgtcacc 60 
atgcccgcca agtttgtcag gtgtgcctac 120 
acagaagtca tccctctggc tgtcacctct 180 
cagattctgg actccaggca ggtgtgtgtc 240 
ctgggcggcc tcatcgtggc cgtggctgtg 300 
ctgcgcacgc tcctctgggt gggggccaag 360 
gtcacagccc tcctgtccat gtggatcagt 420 
atcgtggagg ccatattgca gcagatggaa 480 
gagctggtgg acaagggcaa ggccaaggag 540 
gggagtcaag tgatttttga aggccccact 600 
aggttgtgta aggccatgac cctgtgcatc 660 
accctgaccg ggacgggacc caacgtggtg 720 
gacagcaagg acctcgtgaa ctttgcttcc 780 
gtgatgctgc tgttcgcctg gctgtggctc 840 
aagtcctggg gctgcgggct agagagcaag 900 
caggaggagt accggaagct ggggcccttg 960 
ttcttcctgc tggtcatcct gtggttctcc 1020 
actgttgcct gggtggagga aaggaaaact 1080 
aaggtaaccc aggagaaagt gccctggggc 1140 
ctggctaaag gatccgaggc ctcggggctg 1200 
ttgcacgcag tgcccccggc agccatcacc 1260 
actgagtgca caagcaacgt ggccaccacc 1320 
tctcgctcca tcggcctcaa tccgctgtac 1380 
tttgccttca tgttgcctgt ggccacccct 1440 
ctcaaggttg ctgacatggt gaaaacagga 1500 
gtgtttttgg ctgtcaacac ctggggacgg 1560 
tgggctaatg tgacacatat tgagacttag 1620 
taccctcctc aggactaccg aaccttctgg 1680 
ccccaaaatg acccaacgat gtccacacac 1740 
cctccaagcc cagatgcaga gatggacatg 1800 
ggaacccctc agtgggctgc tggacccatc 1860 
ggaggccagg tagccgaggg atcaggatgc 1920 
cccccacaca gggctctggt tttcactcgc 1980 
atcccctggt tgagagctaa gacaaccacc 2040 
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taccagtgcc catgtccctt ccagctcacc ttgagcagcc tcagatcatc tctgtcactc 2100 
tggaagggac accccagcca gggacggaat gcctggtctt gagcaacctc ccactgctgg 2160 
agtgcgagtg ggaatcagag cctcctgaag cctctgggaa ctcctcctgt ggccaccacc 2220 
aaaggatgag gaatctgagt tgccaacttc aggacgacac ctggcttgcc acccacagtg 2280 
caccacaggc caacctacgc ccttcatcac ttggttctgt tttaatcgac tggccccctg 2340 
tcccacctct ccagtgagcc tccttcaact ccttggtccc ctgttgtctg ggtcaacatt 2400 
tgccgagacg ccttggctgg caccctctgg ggtccccctt ttctcccagg caggtcatct 2460 
tttctgggag atgcttcccc tgccatcccc aaatagctag gatcacactc caagtatggg 2520 
cagtgatggc gctctggggg ccacagtggg ctatctaggc cctccctcac ctgaggccca 2580 
gagtggacac agctgttaat ttccactggc tatgccactt cagagtcttt catgccagcg 2640 
tttgagctcc tctgggtaaa atcttccctt tgttgactgg ccttcacagc catggctggt 2700 
gacaacagag gatcgttgag attgagcagc gcttggtgat ctctcagcaa acaacccctg 2760 
cccgtgggcc aatctacttg aagttactcg gacaaagacc ccaaagtggg gcaacaactc 2820 
cagagaggct gtgggaatct tcagaagccc ccctgtaaga gacagacatg agagacaagc 2880 
atcttctttc ccccgcaagt ccattttatt tccttcttgt gctgctctgg aagagaggca 2940 
gtagcaaaga gatgagctcc tggatggcat tttccagggc aggagaaagt atgagagcct 3000 
caggaaaccc catcaaggac cgagtatgtg tctggttcct tgggtgggac gattcctgac 3060 
cacactgtcc agctcttgct ctcattaaat gctctgtctc ccgcggaaaa aaaaa 3115 

<210> 56 

<211> 2846 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7474129CB1 

<400> 56 

ttccagccat ccctctgcct gcaatgagag cttcccgccg cctcagccac agtcccaccc 60 
gggggccttg ggccccagac atgcggtgat ctcagggcaa gggttgccac gaccacccag 120 
aacctcacca gccatgaaag cccaccccaa ggagatggtg cc tctcatgg gcaagagagt 180 
tgctgccccc agtgggaacc ctgccgtcct gccagagaag aggccggcgg agatcacccc 240 
cacaaagaag agcatctctg gtaactgtga tgacatggac tccccccagt ctcctcaaga 300 
tgatgtgaca gagaccccat ccaatcccaa cagccccagt gcacagctgg ccaaggaaga 360 
gcagaggagg aaaaagaggc ggctgaagaa gcgcatcttt gcagccgtgt ctgagggctg 420 
cgtggaggag ttggtagagt tgctggtgga gctgcaggag ctttgcaggc ggcgccatga 480 
tgaggatgtg cctgacttcc tcatgcacaa gctgacggcc tccgacacgg ggaagacctg 540 
cctgatgaag gccttgttaa acatcaaccc caacaccaag gagatcgtgc ggatcctgct 600 
tgcctttgct gaagagaacg acatcctggg caggttcatc aacgccgagt acacagagga 660 
ggcctatgaa gggcagacgg cgctgaacat cgccatcgag cggcggcagg gggacatcgc 720 
agccctgctc atcgccgccg gcgccgacgt caacgcgcac gccaaggggg ccttcttcaa 780 
ccccaagtac caacacgaag gcttctactt cggtgagacg cccctggccc tggcagcatg 840 
caccaaccag cccgagattg tgcagctgct gatggagcac gagcagacgg acatcacctc 900 
gcgggactca cgaggcaaca acatccttca cgccctggtg accgtggccg aggacttcaa 960 
gacgcagaat gacgttgtga agcgcatgta cgacatgatc ctactgcgga gtggcaactg 1020 
ggagctggag accactcgca acaacgatgg cctcacgccg ctgcagctgg ccgccaagat 1080 
gggcaaggcg gagatcctga agtacatcct cagtcgtgag atcaaggaga agcggctccg 1140 
gagcctgtcc aggaagttca ccgactgggc gtacggaccc gtgtcatcct ccctctacga 1200 
cctcaccaac gtggacacca ccacggacaa ctcagtgctg gaaatcactg tctacaacac 1260 
caacatcgac aaccggcatg agatgctgac cctggagccg ctgcacacgc tgctgcatat 1320 
gaagtggaag aagtttgcca agcacatgtt ctttctgtcc ttctgctttt atttcttcta 1380 
caacatcacc ctgaccctcg tctcgtacta ccgcccccgg gaggaggagg ccatcccgca 1440 
ccccttggcc ctgacgcaca agatggggtg gctgcagctc ctagggagga tgtttgtgct 1500 
catctgggcc atgtgcatct ctgtgaaaga gggcattgcc atcttcctgc tgagaccctc 1560 
ggatctgcag tccatcctct cggatgcctg gttccacttt gtctttttta tccaagctgt 1620 
gcttgtgata ctgtctgtct tcttgtactt gtttgcctac aaagagtacc tcgcctgcct 1680 
cgtgctggcc atggccctgg gctgggcgaa catgctctac tatacgcggg gtttccagtc 1740 
catgggcatg tacagcgtca tgatccagaa ggtcattttg catgatgttc tgaagttctt 1800 
gtttgtatat atcgtgtttt tgcttggatt tggagtagcc ttggcctcgc tgatcgagaa 1860 
gtgtcccaaa gacaacaagg actgcagctc ctacggcagc ttcagcgacg cagtgctgga 1920 
actcttcaag ctcaccatag gcctgggtga cctgaacatc cagcagaact ccaagtatcc 1980 
cattctcttt ctgttcctgc tcatcaccta tgtcatcctc acctttgttc tcctcctcaa 2040 
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catgctcatt gctctgatgg gcgagactgt ggagaacgtc tccaaggaga gcgaacgcat 2100 
ctggcgcctg cagagagcca ggaccatctt ggagtttgag aaaatgttac cagaatggct 2160 
gaggagcaga ttccggatgg gagagctgtg caaagtggcc gaggatgatt tccgactgtg 2220 
tttgcggatc aatgaggtga agtggactga atggaagacg cacgtctcct tccttaacga 2280 
agacccgggg cctgtaagac gaacagattt caacaaaatc caagattctt ccaggaacaa 2340 
cagcaaaacc actctcaatg catttgaaga agtcgaggaa ttcccggaaa cctcggtgta 2400 
gaagcggaac ccagagctgg tgtgcgcgtg cgctgtctgg cgctgcaggc ggagtcaccg 2460 
actctgtgca gagaggcttt gagggatgat ggagtccggc tctgctggcc tagaagcaga 2520 
gtgcaccctc gtgctcagtg ctcagtgggt gtctgaactg aggggcagtt gtcaatttgt 2580 
ctgagtggga aacatcctgg attttgttac ttggcaaaca gctggtgtaa acctacagcc 2640 
agcagcagtc tggagcctgg gagcctcctg aagtcccggg tgaagcctct ggttttacca 2700 
attgcaggtc ggcttggctg ggagagatgg atggcgggaa aggggcagca gtcttgagga 2760 
gcagggagag gagtctttcc tcctgccagc ttcccccgtc agccccaacc ccagcccaca 2820 
cattgtacca tctcttctgc tgtgac 2846 

<210> 57 

<211> 906 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7481414CB1 



<400> 57 

atgaagtctc accctgccat 
ggagcttgtg tgctgactgg 
cctcagctgt acaaaggcct 
cgtggccttt acaggggaac 
ctgttcatgt gctttggctt 
aatgcagagc tgaacgactt 
gcgctggctc tctgccccac 
aagatgtcag ggaagatagc 
ttcatgaagg atggtccctt 
atacctggct atttcttcta 
gggggatcaa aggatgaact 
atctgtctct ggcttatcat 
tctatgtttg ggaagcctgc 
gggatatcag ccttgtattc 
gctgctctct ttttggttta 
tactga 

<210> 58 

<211> 1840 

<212> DNA 

<213> Homo sapiens 



ccaagccgcc atagacctca 
gcaacccttc gacaccataa 
tgccgactgc ttcctgaaaa 
cagtcctgca ctgctagcct 
ttgccaacag tttgtcagga 
ggagactgct actgctgggt 
tgagcttgtg aagtgtcggc 
acaaagctat aacacaattt 
aggcttctat cgtggactct 
ctttgggggc tatgaaatca 
aggccctgtc cctttgatgt 
attcccagtg gactgcatta 
aggattaatc gaaaccttta 
tggattgaaa gccactctga 
tgagtacagc agaaagatga 



ctgcgggcgc agcagggggc 60 
aagtaaagat gcagacattt 120 
catacaacca agtgggcatc 180 
atgtcaccca gggttctgtc 240 
aagtggctag agtggagcag 300 
cgctggcttc tgcatttgct 360 
tgcagaccat gtatgagatg 420 
ggtctatggt taagagtatc 480 
cgaccactct tgctcaggaa 540 
gtcgatcatt ttttgcatca 600 
taagtggagg ctttgctggg 660 
aatccagaat ccaggttctt 720 
taagtgttgt gagaaatgaa 780 
ttcgagccat cccttccaat 840 
tgatgaacat ggtggaagaa 900 

906 



<220> 

<22i> mis cofeature 

<223> Incyte ID No: 7481461CB1 



<400> 58 

gcggccgcct gcgcgctggc cgcctgcgcg 
gccgccctcg cctcggcctc gttagcccgc 
ctggtaactc gcgtccctcg cgcttctccg 
gctctcccag gaggagccgg actccgcgcg 
cgcgcccacg ggggccgctc cgccgcccgg 
tgtcgagaag gtggaggtgg agctggcggg 
cgaacccccc gagggcggct ggggctggct 
gtcggtgttc ggcatccaga acgcttgcgg 
cggctccaaa gacgatgaca agatggtctt 
ggggatgatt ttcttttgct gcccaatagt 
gaaaacagct gtcgtgggtg ctgctgttgg 



ctgccagccc gcccgcccgc caggggctcc 60 
caggagcccc gcagctcctc cgggagcccg 120 
gcgcctgagg ggcccgcctc gggccatggt 180 
gggcacgagc gaggcgcagc cgctcggccc 240 
cccgggaccc tcggacagcc ccgaggcggc 300 
gccggcgacc gcggagcccc atgagccccc 360 
ggtgatgctg gcggccatgt ggtgcaacgg 420 
ggtgctcttc gtgtccatgc tggaaacctt 480 
taagacagca tgggtaggtt ctctctccat 540 
cagcgtcttc acagacctat ttggttgtcg 600 
atttgttggg ctcatgtcca gttcttttgt 660 
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aagttccatc gagcctctgt 
tgcataccag ccttcattgg 
gaatggcatt gtcactgctg 
ggttctgatt gacagcgtgg 
tgttctcttt ctggctggct 
gagtggaggt agcggatcct 
tttcaatttt gccatcttca 
tgcacttttt ggatactttg 
tcaagatgaa aaaaataaag 
acgactgctc tttggccgga 
ggtactctcc tttttcttca 
tggggccctc attgctgtgt 
tatggctccc atagcctttg 
tctgctcgga ttcatgtcta 
tgacaaactg ggctcctatg 
aggtgctgtg ctttgtttta 
aaccactgga aaagaaaaga 
ttcatctgga atgttcaaga 
ccagactgga cttgcttttt 
aatttcatat ttttttaatc 

<210> 59 
<211> 5348 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 7472541CB1 

<400> 59 

gttagcttga ggccttgcct tgacataatc agatataatc agaaaaatga gaaattccat 60 
aggaaagaga actattttag ccaaggtgtg cgagagaaat accgccactt tcaagcactg 120 
ttttcttcta ctggagtctg ctcaataggg acgtcagctt tgctggggct tcctttgaca 180 
agagaatcag aaccgactgg tgacatttgt ttcaatgaaa gcaacagtgt gaagaaactt 240 
. gcaatttatt ctgatcaaca atttgcccag ctaaagtact acatctgccc cctcttcctg 300 
tggctgtagg ggcacagcaa aggtcactgg tctaacctcc ttaaagggac tccgctaaca 360 
gatcttcgcc tgctgctgga aatggccctc tcagtggact catcgtggca tcggtggcag 420 
tggagagtca gagatggctt cccccattgt ccatcggaaa ccacaccgct gctctctcca 480 
gagaaaggga gacagagcta caacttgaca cagcagcggg tcgtgttccc caacaacagc 540 
atattccatc aagattggga agaggtctcc aggagatacc ctggcaacag aacctgcaca 600 
accaaataca ccctcttcac cttcctgccc cggaatctct ttgagcaatt tcatagatgg 660 
gctaacctct atttcctgtt cctggtgatt ttgagctgga tgccctccat ggaagtcttc 720 
cacagagaaa tcaccatgtt accattggcc attgtcctgt tcgtcatcat gatcaaggat 780 
ggcatggagg acttcaagag acaccgcttt gataaagcaa taaactgctc caacattcga 840 
atttatgaaa gaaaagagca gacctatgtg cagaagtgct ggaaggatgt gcgtgtggga 900 
gacttcatcc aaatgaaatg caatgagatt gtcccagcag acatactcct ccttttttcc 960 
tctgacccca atgggatatg ccatctggaa actgccagct tggatggaga gacaaacctc 1020 
aagcaaagac gtgtcgtgaa gggcttctca cagcaggagg tacagttcga accagagctt 1080 
ttccacaata ccatcgtgtg tgagaaaccc aacaaccacc tcaacaaatt taagggttat 1140 
atggagcatc ctgaccagac caggactggc tttggctgtg agagtcttct gcttcgaggc 1200 
tgcaccatca gaaacaccga gatggctgtt ggcattgtca tctatgcagg ccatgagacg 1260 
aaagccatgc tgaacaacag tggcccccgg tacaaacgca gcaagattga gcggcgcatg 1320 
aatatagaca tcttcttctg cattgggatc ctcatcctca tgtgccttat tggagctgta 1380 
ggtcacagca tctggaatgg gacctttgaa gaacaccctc ccttcgatgt gccagatgcc 1440 
aatggcagct tccttcccag tgcccttggg ggcttctaca tgttcctcac aatgatcatc 1500 
ctgctccagg tgctgatccc catctctttg tatgtctcca ttgagctggt gaagctcggg 1560 
caagtgttct tcttgagcaa tgaccttgac ctgtatgatg aagagaccga tttatccatt 1620 
caatgtcgag ccctcaacat cgcagaggac ttgggccaga tccagtacat cttctccgat 1680 
aagacgggga ccctgacaga gaacaagatg gtgttccgac gttgcaccat catgggcagc 1740 
gagtattctc accaagaaaa tgctaagcga ctggagaccc caaaggagct ggactcagat 1800 
ggtgaagagt ggacccaata ccaatgcctg tccttctcgg ctagatgggc ccaggatcca 1860 
gcaactatga gaagccaaaa aggtgctcag cctctgagga ggagccagag tgcccgggtg 1920 



accttaccta tggaatcata 
tcattttggg acactatttc 
gcagcagtgt cttcacaatc 
gcctctttta cacattgagg 
ttacttaccg acctcttgct 
ccctcttttc caggaaaaag 
aggtgacagc ttatgcagtg 
tgccttatgt tcacttgatg 
aggttgttct catgtgcatt 
ttgcagatta tgtgcctggt 
ttggtctgat gtccatgatg 
gcctcatcat gggtctcttc 
agttagttgg tgcccaggat 
tacccatgac tgttggccca 
atgtggcatt ctacctcgct 
tcccgtggat ccatagtaag 
tggagaaaat gttggaaaac 
aagaatctga ctctattatt 
gaattttaag caagtttcct 
acatcctagg aatagcacaa 



tttgcctgcg gctgctcctt 720 
aagaagcgcc ttggactggt 780 
ctgctgcctt tgctcttaag 840 
gtgctctgca tcttcatgtt 900 
accagtacca aagataaaga 960 
ttcagtcctc caaaaaaaat 1020 
tgggcagttg gaataccact 1080 
aaacatgtaa atgaaagatt 1140 
ggcgtcactt caggagttgg 1200 
gtgaagaagg tttatctaca 1260 
attcctctgt gtagcatctt 1320 
gatggatgct tcatttccat 1380 
gtctcccaag caattggatt 1440 
cccattgcag ggttacttcg 1500 
ggagtccctc cccttattgg 1560 
aagcaaagag agatcagtaa 1620 
cagaactctc tgctgtcaag 1680 
taatatctta catacctcca 1740 
ttccttttat acaaattgca 1800 

1840 
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cccatccagg gccactaccg gcaaaggtct 
gtggccttca gcagctccat agaaaaagat 
gttcgagatg ctgccctgtg gttggagacc 
ctctccacca cctcctccat tgctgatttc 
atggtgtcca caaccaccga gcccaggcag 
gaccattgtc aatgcttaga atttcagggc 
aaaagtacct tcatattccg cataagacaa 
attccacttt ccttctttgg ccacaaggtc 
acgtccctgg agaagattca gcagctcttc 
tcattctcat ccactgcacc ctctgacaca 
gccaccacag actcggatga gagagatgat 
gatgacggtg gctacaggag cagcatgtgg 
ggcacttcct tggaggaggc attggaggcc 
tgttacgagg ctgagagccc tgatgaggcc 
ttcacactag tgtcccggac acctgagcag 
ctcaccttca gcctcctctg caccctgggc 
gttgtgaggc acccactgac tggcgagatt 
atcatggacc tgctggaaga cccagcctgc 
agaaaaatcc gagcccggac ccaaaagcat 
acactatgca ttgccaagaa ggttgtaagc 
cggcgtgagg ctgaggcatc cctcgacaac 
catctggaga atcaactcac cttacttgga 
ggagttccag atacgattgc cactctgcgg 
ggagataagc aggagacagc ggtcaacatt 
gacactgttt ataccatcaa tacagagaat 
gcattggaag agctaaagca atttcgtgaa 
ttccgcttac cttccaagac accatccatc 
ttggtcatcg atgggaagac attgaatgcc 
ctggaattga cccagtattg tcggtccgtc 
agtatgatag tcaagctggt gcgagacaag 
ggagcaaatg atgtaagcat gattcaagct 
gaaggcatgc aggctgtcat gtccagcgac 
aagttgctgc tcgtgcatgg ccactggtgt 
tacctctaca agaacgtgtg ctacgtcaac 
ttctccagct ccaccatgat tgattactgg 
tccttgcctc ctcttgtctt tggagtcctt 
gcattgcctg agctatacaa gagtggccag 
tggatttcta tggtggatgc attctaccag 
gcctataagg gctctgatat agatgtcttt 
ctcaccacaa tccttttgca ccaggcaatg 
gtcgtgctcc tcggcagctt cctgatgtac 
tgcgtcatct gcaacagccc caccaatccc 
cccactttct acctcgtctg ctttctcaca 
ttcctgtctc tgcaaggaac ttgtgggaag 
aaactccccc cagacaaaag aaacctggaa 
gcccctgtcc ccgaagtggc tcgaccaact 
gacttcagtg ccagcacccc aaagagctct 
tcagtgctcc acgaacagag atgtggcacg 
gactcctcag ctcaactctc atccggggag 
tactcaggag gacagactga tatgtgccgg 
cagagttcac tgaccatatg aggagctgca 
tagtcactgg tccacataac ccttgacccc 
tcttattctt ttcttcaaca accttgactt 
tgacacaaag acgggaaacc cagtcggcct 
cttgcaaaca gtttggagaa ggaggctgtt 
gaccaaagcc attcttctga ccctctagat 
ttccaagatt caagaagtgt atcaacccag 
tttattta 



atggggcacc gtgaaagctc acagcctcct 1980 
gtaactccag ataaaaacct actgaccaag 2040 
ttgtcagaca gcagacctgc caaggcttcc 2100 
ttccttgcct taaccatctg caactctgtc 2160 
aggtgggatg atcaaaagat agtggaaaat 2220 
tggaggaaaa tatctggctt cacttattgc 2280 
cttggtatta tttccaacat tgagagtaat 2340 
accatcaaac cctcaagcaa ggctctgggg 2400 
cagaagttga agctattgag cctcagccag 2460 
gacctcgggg agagcttagg ggccaacgtg 2520 
gcatctgtgt gcagtggagg tgactccact 2580 
gaccagggcg acatcctgga gtctgggtca 2640 
ccagccacag acctggccag gcctgagttc 2700 
gccctggtgc acgctgccca tgcctacagc 2760 
gtgactgtgc gcctgcccca gggcacctgc 2820 
tttgactctg tcaggaagag aatgtctgtg 2880 
gttgtctaca ccaagggtgc tgactcggtc 2940 
gtacctgaca ttaatatgga aaagaagctg 3000 
ctagacttgt atgcaagaga tggcctgcgc 3060 
gaagaggact tccggagatg ggccagtttc 3120 
cgagatgagc ttctcatgga aactgcacag 3180 
gccactggga tcgaagaccg gctgcaggaa 3240 
gaggctggga tccagctctg ggtcctgact 3300 
gcccattcct gcagactgtt aaatcagacc 3360 
caggagacct gtgaatccat cctcaattgt 3420 
ctacagaagc cagaccgcaa gctctttgga 3480 
acctcagaag ctgtggttcc agaagctgga 3540 
atcttccagg gaaagctaga gaagaagttt 3600 
ctgtgctgcc gctccacgcc actccagaag 3660 
ttgcgcgtca tgaccctttc cataggtgat 3720 
gctgatattg gaattggaat atctggacag 3780 
tttgccatca cccgctttaa gcatctcaag 3840 
tactcgcgcc tggccaggat ggtggtgtac 3900 
ctgctcttct ggtatcagtt cttctgtggt 3960 
cagatgatat tcttcaatct cttctttacc 4020 
gacaaagaca tctctgcaga aacactcctg 4080 
aactctgagt gctataacct gtcgactttc 4140 
agcctcatct gtttctttat cccttacctg 4200 
acctttggga caccaatcaa caccatctcc 4260 
gaaatgaaga catggaccat tttccacgga 4320 
tttctggtat ccctcctgta caatgccacc 4380 
tattgggtga tggaaggcca gctctcaaac 4440 
ccagttgttg ctcttctccc aagatacttt 4500 
tctctaatct caaaagctca gaaaattgac 4560 
atccagagtt ggagaagcag acagaggcct 4620 
caccacccag tgtcatctat cacaggacag 4680 
aaccctccca agaggaagca tgtggaagag 4740 
gagtgcatga gggatgactc atgctcaggg 4800 
cacctgctgg gacctaacag gataatggcc 4860 
tgctcaaaga ggagcagcca tcgccgatcc 4920 
gaaatctgta caaactcaac agaggccacc 4980 
ttcttcttca tagaggaaac aatgtgccag 5040 
ccatggagga agtgctggcc ccaaggggtc 5100 
ctagttttct gctgctctca ggcagcacat 5160 
tttgttgaat cgagttctca aatcggttta 5220 
aagcgtagcc tacaacccag tgccgtaagt 5280 
gcaatatctc aggatatgga agtttctggg 5340 

5348 



<210> 60 

<211> 5149 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 6999183CB1 

<400> 60 

atgagcaaga gacgcatgag cgtgggtcag 
ctcaaaaaat ggagaatgaa aagacagacc 
gtactgtttc tgtacctatt tttctccaat 
tcttcaatgg atctgggacg tgtagatagt 
gcacctgaat ccaaaactac ccaagagata 
atggcaggaa gaacaatcat ggggtggcct 
aactattcaa tagacgcagt gagagtcatc 
ttttcttggg gacatagaat ccccatgatg 
caagcagtga atgaaaaaat gaagtgtgaa 
gcttttcaag ctgccattaa tgctgctatc 
gaacagctga tgtcagttac tggtgtacat 
ggagttgcaa ctgatttttt cattttcttt 
tatgtatcag tcaatgttac acaagaaaga 
ggactccgag agtcagcatt ctggctttcc 
atcatggcca ctttaatggc tcttattgta 
tttgtgatgg tcttcaccct ctttctcctc 
ctgatgagtg tgttgataaa gaaacctttc 
gtcttttggg ggatcctggg attcccagca 
tggactttgt gtcttcttag cccctttgcc 
ttggactatg atgtgaattc taatgcccac 
atagctactc ttttcatgtt ggtttttgac 
tttgacaaaa ttttgcccgc tgaatatgga 
tcctgttttt ggtttcaaca cggaagggct 
tctgatccta cacctaatga ctgttttgaa 
gccatcagaa tcaaaaatct taaaaaagaa 
ttgaaaggtg tggtgtttga catatatgaa 
ggagctggaa aaactaccct gttaaacata 
tcagtcactg tctataatca cacactttca 
ttcactggat tttgtccaca atccaatgtg 
ctcaggctgt ttgctaaaat aaaagggatt 
ctattggatg aaccgactgc tggattggat 
ctgaaagagg ggaaatcaga cagagtaatt 
gacattctgg cggacaggaa ggtgttcata 
tctctgttcc ttaagaagaa atggggcata 
aggtgtgatc cagagagtat aacatcactg 
acagcacaaa gtgaagaaaa acttgtatat 
ccagaacttt acagggatct tgatagatgt 
tccataacaa ctttgaatga ggtgtttctg 
tcagatattg gaatttgggg acaattacaa 
gttgagctgg aacaagtttt gtcttccttc 
gcgctctgga ggcagcaggt ctgtgcaata 
gaaagaaaaa gcctgtggac tatattattg 
ttggaacatc tattctacga gtcatatcag 
acatacttcc tctcaccagg acaacaacca 
aataagacag ggtcaaccat tgataacttt 
atagaagtgg atgcctttgg aactagaaat 
atcattgtgt caggtgatga aaaggatcac 
ctgaattgct ttcctgtcct cctggatgtc 
tcgtcagaac acattcagac tgacagaagc 
tatgggtacc gaagtaacac cttcttctgg 
attgcaatga gcagcattgg tgactacaaa 
ggcctctacc cttctgcata ctggtttggc 
ttgatcctcc tgctaatgca aataatggat 
ataattcaaa acctgttaat tcaaatcctg 
ttcttgacat atgtgatttc attcattttt 
tcatttttct tcttaattgt ggtcatcttc 
ggatttctag ggctattttt tggcaccatg 
ctattcattt tttctgagat ttctcctgat 



caaacatggg ctcttctctg caagaactgt 60 
ttgttggaat ggctcttttc atttcttctg 120 
ttacatcaag ttcatgacac tcctcaaatg 180 
tttaatgata ctaattatgt tattgcattt 240 
atgaacaaag tggcttcagc cccattccta 300 
gatgaaaaaa gcatggatga attggatttg 360 
tttactgata ccttctccta ccatttgaag 420 
aaagagcaca gagaccattc agctcactgt 480 
ggttcagagt tctgggagaa aggctttgta 540 
atagaaatcg caacaaatca ttcagtgatg 600 
atgaagatat taccttttgt tgcccaagga 660 
tgcattattt ctttttctac atttatatac 720 
caatacatta cgtcattgat gacaatgatg 780 
tggggtttga tgtatgctgg cttcatcctt 840 
aaatctgcac aaattgtcgt cctgactggt 900 
tatggcctgt ctttgataac tttagctttc 960 
cttacgggct tggttgtgtt tctccttatt 1020 
ttgtatacac atcttcctgc atttttggaa 1080 
ttcactgttg ggatggccca gcttatacat 1140 
ttggattctt cacaaaatcc atacctcata 1200 
acccttctgt atttggtatt gacattatat 1260 
catcgatgtt ctcccttgtt tttcctgaaa 1320 
aatcatgtgg tccttgagaa tgaaacagat 1380 
ccagtgtctc cagaattctg tgggaaggaa 1440 
tatgcaggga agtgtgagag agtagaagct 1500 
ggccagatca ctgccctcct tggtcacagt 1560 
cttagtgggt tgtcagttcc aacatcaggt 1620 
agaatggctg atatagaaaa tatcagcaag 1680 
caatttggat ttctcactgt gaaagaaaac 1740 
ttgccacatg aagtggagaa agaggtattg 1800 
cctctttcaa ggcaccgaat atggaatctc 1860 
ctcttcagca cccagtttat agatgaggct 1920 
tccaatggga agctgaagtg tgcaggctct 1980 
ggctaccatt taagtttgca tctgaatgaa 2040 
gttaagcagc acatctctga tgccaaattg 2100 
attttgcctt tggaaaggac aaacaaattt 2160 
tctaaccaag gcattgagga ttatggtgtt 2220 
aaattagaag gaaaatcaac tattgatgaa 2280 
actgatgggg caaaagatat aggaagcctt 2340 
cacgaaacaa ggaaaacaat cagtggcgtg 2400 
gcaaaagttc gcttcctaaa gttaaagaaa 2460 
ctttttggta ttagctttat ccctcaactt 2520 
aaaagttacc cgtgggaact gtctccaaat 2580 
caggatcctc tgacccattt actggtcatc 2640 
ttacattcac tgaggcgaca gaacatagct 2700 
ggcacagatg acccatctta caatggtgct 2760 
agattttcaa tagcatgtaa tacaaaacgg 2820 
attagcaatg gactacttgg aatttttaat 2880 
acattttttg aagagcatat ggattatgag 2940 
ataccgatgg cagcctcttt cactccatac 3000 
aaaaaagctc attcccagct acggatttca 3060 
caagcactgg tggatgtttc cctgtacttt 3120 
tatattttta gcccagagga gattatattt 3180 
tgtagtattg gctatgtctc atctcttgtt 3240 
cgcaatggga gaaaaaatag tggcatttgg 3300 
tcgatagttg ctactgatct aaatgaatat 3360 
ttaatacctc ccttcacatt gattggctct 3420 
tccatggatt acttaggagc ttcagaatct 3480 



84/85 



WO 02/12340 



PCT/US01/24217 



gaaattgtat acctggcact gctaatacct 
ctgcgatgcc tagaaatgaa ctgcaggaag 
atttctccaa gaagcaacgc tatttttcca 
gatatccaga tggaaagaat gagaacagtg 
acacccgtca tcattgccag ctgtctacgg 
ttttctaaaa ggaagaaaac aattgccaca 
gaagttatag gactgttagg acacaatgga 
actggagaca caaaaccaac tgcaggacag 
cccctgggct tcctggggta ctgccctcag 
aggcagcacc tggaggtgta cgctgccgtg 
gccatcacac ggttagtgga tgcgctcaag 
accttgtcag agggaataaa gcgaaagctg 
tcagtggtgc ttctggatga gccgtcgacc 
tggcaggtga ttcgggccac ctttagaaac 
tacatggcag aggctgaggc ggtgtgtgac 
agatgtattg gttccatcca acacctgaaa 
atgaagctga agaacctggc acaaatggag 
ccccaggctg ctcagcagga aaggttctcc 
gatgtgcgac ctttatcaca ggctttcttc 
ctggaggagt acagcctctc acagtctacc 
gagcaggagc tgggtgatct tgaagaggac 
ctgcaggaag agccttaaag ctccaatacc 
taaagataat attttatagc cttaatatgc 
aaactcatgc caataattat cctcagtagt 
gtgagggcga tcatagggca taagcctaag 
aatcccatgt ttcctactgt gttaagttta 
ttctgaggat gtcatggaga atgcttaatt 
ttttcttatt tttttgatta aagtataaat 



taccttcatt ttctcatttt tcttttcatt 3540 
aaactaatga gaaaggatcc tgtgttcaga 3600 
aacccagaag agcctgaagg agaggaggaa 3660 
aatgctatgg ctgtgcgaga ctttgatgag 3720 
aaggaatatg caggcaaaaa gaaaaattgc 3780 
agaaatgtct ctttttgtgt taaaaaaggt 3840 
gctggtaaaa gtacaactat taagatgata 3900 
gtgattttga aagggagcgg tggaggggaa 3960 
gagaatgcgc tgtggcccaa cctgacagtg 4020 
aaaggtctca ggaaagggga cgcaatgatc 4080 
ctgcaggacc agctgaaggc tcccgtgaag 4140 
tgctttgtgc tgagcatcct ggggaacccg 4200 
gggatggacc ccgaggggca gcagcaaatg 4260 
acggagaggg gcgccctcct gaccacccac 4320 
cgagtggcca tcatggtgtc aggaaggctg 4380 
agcaaatttg gcaaagacta cctgctggag 4440 
cccctccatg cagagatcct gaggcttttc 4500 
tccctgatgg tctataagtt gcctgttgag 4560 
aaattagaga tagttaaaca gagtttcgac 4620 
ctggagcagg ttttcctgga gctctccaag 4680 
tttgatccct cggtgaagtg gaaactcctc 4740 
ctatatcttt ctttaatcct gtgactcttt 4800 
cttatatcag aggtggtaca aatgcatttg 4860 
atttcttaca gtgagacaca ggcgatgtca 4920 
ccataccatg cagcctttgt gccagcaacc 4980 
aaaatgcatt tattatagaa ttgtctacat 5040 
ttcttcctct gaacttcaaa atattaaata 5100 
taagacaccc tattgactt 5149 
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